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TITLE OF THE INVENTION 

CELL ABLATION USING 
TRANS-SPLICING RIBOZYMES 

Field of the Invention 

The present invention is directed to novel trans- 
splicing ribozymes and methods of cell ablation using 
these ribozynes. 

BRIEF DESCRIPTION OF THE BACKGROUND ART 

I. Grout) I Introns 

RNA molecules with catalytic activity are called 
ribozynes or RNA enzymes (Cech, T.R., Ann. Rev. 
Biochem. 59:543-568 (1990) . The Tetrahyjnena thermophila 
precursor rRNA contains an intron (a ribozyme) capable 
of catalyzing its own s::eision. This ribozyme isrone 
of a class of structurally related Group I introns. 

The splicing activity of the modified S\ 
thermophila intron requires the presence of a guanosine 
cof actor and a divalent cation, either Mg* 4 or Mn ++ , 
and occurs via two sequential transesterif ication 
reactions (Figure 1) • First, a free guanosine is 
bound to the ribozyme and its 3' hydroxyl group is 
positioned to attack the phosphorus atom at the 5' 
splice site. The guanosine is covalently attached to 
the intron sequence and the 5' exon is released. 
Second, the phosphodiester bond located at the 3' 
splice site undergoes attack from the newly freed 3' 
hydroxyl group of the 5' exon, resulting in production 
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of the ligat d xon sequenc s. Th xcised intron 
subsequently undergoes a series of trans sterif ication 
reactions, involving its 3« hydroxyl group and internal 
sequences, resulting in the formation of shortened 

circular forms. 

These successive reactions are chemically similar 
and appear to occur at a single active site. The 
reactions of self-splicing are characterized by the 
formation of alternative RNA structures as differing 
FNA chains are each brought to form similar 
conformations around the highly conserved intron. 
Splicing requires the alignment of the intron-exon 
junctions across a complementary sequence termed the 
"internal guide sequence" or IGS. 

The first cleavage at the 5' splice site requires 
the formation of a base-paired helix (PI) between the 
IGS and sequences adjacent the splice site. The 
presence of a U:G "wobble" base-pair within this helix 
defines the phosphodiester bond that will be broken in 
the catalytic reaction of the ribozyme. After cleavage 
of this ^ bonK T portion the PI helix is displaced and 
a new helix, P10, is formed due to complementarity 
between the IGS and sequences adjacent the 3' splice 
site. An invariant guanos ine residue precedes the 
phosphodiester at the 3- splice site, similar to the 
portion of the PI sequence that it is displacing. 
Thus, ligation of the exons occurs in a reverse of the 
first cleavage reaction but where new exon sequences 
have been substituted for those of the intron. It may 
be noted that intron circularization reactions 
subsequent to exon ligation also involve base-pairing 
of 5' sequences across the IGS, and attack mediated by 
the 3' hydroxyl group of the intron 's terminal guanine 
residue (Been. M.D. et al . , "Selection Of 
Circularizaton Sites In A Group I IVS RNA Requires 
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Multiple Alignments Of An Internal T mplate-Like 
Sequence," Cell SO: 951 (1987)). 

II. Catalytic Activities 

In order to better define the structural and 
catalytic properties of the Group I introns, exon 
sequences have been stripped from the "core" of the T. 
thermophila intron. Cech, T.R. et al. # WO 88/04300, 
describes at least three catalytic activities possessed 
by the Tetrahymena intron ribozyme: (1) a 

dephosphorylating activity, capable of removing the 3 1 
terminal phosphate of RNA in a sequence-specific 
manner, (2) an RNA polymerase activity (nucleotidyl 
transferase) , capable of catalyzing the conversion of 
oligoribonucleotides to polyribonucleotides, and (3) a 
sequence-specific endoribonuclease activity. 

Isolated ribozyme activities can interact with 
substrate RNAs in trans, and these interactions 
characterized. For example, when truncated forms of the 
intron are incubated with sequences corresponding to 
the 5 5 splice juhttrohr the site undergoes guanosien~ 4 
dependent cleavage in mimicry of the first step in 
splicing. The substrate and endoribonucleolytic intron 
RNAs base-pair to form helix PI, and cleavage occurs 
after a U:G base-pair at the 4th-6th position. 
Phylogenetic comparisons and mutational analyses- 
indicate that the nature of the sequences immediately 
adjacent the conserved uracil residue at the 5 1 splice 
site are unimportant for catalysis, provided the base- 
pairing of helix PI is maintained (Doudna, J. A. et al., 
Proc. Natl. Acad. Sci. USA 86: 7402-7406 (1989)). 

The sequence requirements for 3' splice-site 
selection appear to lie mainly within the structure of 
the intron itself, including helix P9.0 and the 
following guanosin residue which delineates the 3* 
intron boundary. However, flanking sequences within the 
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3 • exon are r quired for the formation of helix P10 and 
efficient splicing, as shown by mutational analysis 
(Sun, E.R. et al. f Mol. Cell. Biol. 10:2960-2965 
(1990)}. In addition, oligonucleotides have been 
ligated in trans, using a truncated form of the intron, 
and "external" guide sequence and oligonucleotides 
which had been extended by a 5« guanosine residue. The 
substrate oligonucleotides corresponding to 3* exon 
sequences were aligned solely by the formation of P10- 
like helices on an external template, prior to ligation 
(Doudna, J.A. et al., nature 339:519-522 (1989)). 

The cleavage activity of ribozyroes has been 
targeted to specific RNAs by engineering a discrete 
"hybridization" region into the ribozyme, such 
hybridization region being capable of specifically 
hybridizing with the desired RNA. For example, 
Gerlach, W.L. et al., EP 321,201, constructed a 
ribozyme containing a sequence complementary to a 
target RNA. Increasing the length of this 
complementary sequence increased the affinity of this 
sequence for the target. However, the hybridizing anf^ ff 
cleavage regions of this ribozyme were integral parts 
of each other. Upon hybridizing to the target RNA 
through the complementary regions, the catalytic region 
of the ribozyme cleaved the target. It was suggested 
that the ribozyme would be useful for the inactivation 
or cleavage of target RNA in vivo, such as for the 
treatment of human diseases characterized by the 
production of a foreign host's RNA. However, ribozyme- 
directed trans-splicing, (as opposed to trans-cleavage) 
was not described or suggested. 

The endoribonuclease activities (the cleavage 
activities) of various naturally-occurring ribozymes 
have been extensively studied. Analysis of the 
structure and sequence of these ribozymes has indicated 
that certain nucleotides around the cleavage site are 
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highly conserved but flanking sequences are not so 
conserved. This information has 1 ad to the design of 
novel endoribonuclease activities not found in nature. 
For example, Cech and others have constructed novel 
ribozymes with altered substrate sequence specificity 
(Cech, T.R. et al., WO 88/04300; Koizumi, M. et al., 
FEBS Lett. 228:228-230 (1988); Koizumi, M. et al., FEBS 
Lett. 239:285-288 (1988); Haseloff, J. et al., Nature 
334:585-591 (1987); and Heus, H.A. et al., Nucl. Acids 
Fes. 18:1103-1108 (1990)). From early studies of the 
self-cleaving plant viroids and satellite RNAs 
(Buzayan, J.M. et al., Proc. Natl. Acad. Sci. USA 
83:8859-8862 (1986), guidelines for the design of 
ribozymes that are capable of cleaving other RNA 
molecules in trans in a highly sequence specific have 
been developed (Haseloff, J. et al., Nature 334:585-591 
(1988)). However, these constructs were unable to 
catalyze efficient, targeted trans-splicing reactions. 

The joining of exons contained on separate RNAs, 
that is, trans-splicing, occurs in nature for both 
A shRNP-inediated and self- catalyzed group I and group -U"g 
introns. In trypanosome and Caenorhabditis eleqans $ 
mRNAs, common 5 f leader sequences are transcribed from 
separate genes and spliced to the 3* portions of the 
mRNAs (Agabian, N. , Cell 61:1157-1160 (1990); Hirsh, D. 
et al., Mol. Biol. Pep. 14:115 (1990). These small 
"spliced leader" RNAs (slRNAs) consist of the 5 1 exon 
fused to sequences that can functionally substitute for 
Ul snRNA in mammalian snRNP-spl icing extracts. 

Also, both the group I and group II self -splicing 
introns are capable of exon ligation in trans in 
artificial systems (Been, M.D. et al., Cell 47:207-216 
(1986)? Galloway-Salvo, J.L. et al., J. Mol. Biol. 
211:537-549 (1990); Jacquier, A. et al., Science 
234:1099-1194 (1986); and Jarrell, K.A. et al., Mol. 
Cell Biol. 8:2361-2366 (1988)). Trans-splicing occurs 
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in vivo for group II introns in split g nes of 
chloroplasts (Kohchi, T. et al., Nucl. Acids Res. 
16:10025-10036 (1988)), and has been shown for a group 
I intron in an artificially split gene in Escherichia 
coli (Galloway-Salvo, J.L. et al., J. Mol. Biol. 
211:537-549 (1990)). In the latter case, a 
bacteriophage T4 thymidylate synthase gene (td) 
containing a group I intron was divided at the loop 
connecting the intron helix P6a. Transcripts of the td 
gene segments were shown to undergo trans-splicing in 
vitro, and to rescue dysfunctional E. coli host ceils. 
Known base-pairings (P3, P6 and P6a) and possible 
tertiary interactions between the intron segments, 
allowed correct assembly and processing of the gene 
halves. 

In vitro, the Tetrahymena ribozyme is capable of 
catalyzing the trans-splicing of single-stranded model 
oligoribonucleotide substrates. Four components were 
necessary: ribozyme. 3« single-stranded RNA, 5- exon 
and GTP. A shortened form of the Tetrahymena ribozyme 
r ' : (L-21 ScaFlVS RNA), starting at the internal guide 
sequence and terminating at U 409 has been used in such 
a reaction (Flanegan, J.B. et al., J. Cell. Biochem. 
(Supp.)12 part D-.2S (1988)). Attack by GTP at the 5' 
splice site released the 5' exon which was then ligated 
by the ribozyme to the 3* exon in a transesterif ication 
reaction at the 3 • splice site. 

The in vivo use of ribozymes as an alternative to 
the use of antisense RNA for the targeting and 
destruction of specific RNAs has been proposed 
(Gerlach, W.L. et al., EP32l,20l; Cotten, M. , Trends 
Biotechnol. 8:174-178 (1990); Cotten, M. et al., EMBO 
j. 8:3861-3866 (1989); Sarver, N. et al.. Science 
247:1222-1225 (1990)). For example, expression of a 
ribozyme with catalytic endonucl olytic activity 
towards an RNA expressed during HIV-1 infection has 
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b en suggested as a potential th rapy against human 
immunodeficiency virus type 1 (HIV-l) inf ction 
(Sarver, N. et ai., Science 247:1222-1225 (1990); 
Cooper, M. , CDC AIDS Weekly, April 3, 1989, page 2; 
Rossi, J.J., Abstract of Grant No. 1R01AI29329 in 
Dialog's Federal Research in Progress File 265). 
However, such attempts have not yet been successful. 

In a study designed to investigate the potential 
use of ribozymes as therapeutic agents in the treatment 
of human immunodeficiency virus type 1 (HIV-l) 
infection, ribozymes of the hammerhead motif (Hutchins, 
C.J. et al., Nucl. Acids Res. 14:3627 (1986) ; Keese, P. 
et al., in Viroids and Viroid-Like Pathogens, J.S. 
Semancik, ed., CRC Press, Boca Raton, FL, 1987, pp. 1- 
47) were targeted to the HIV-l gag transcripts. 
Expression of the gragr-targeted ribozyme in human cell 
cultures resulted in a decrease (but not a complete 
disappearance of) the level of HIV-l gag RNA and in 
antigen p24 levels (Sarver, N. et al., Science 
247:1222-1225 (1990)). Thus, the medical effectiveness 
of Sarver v s < Hbozyfte : was limited by its low efficiency * 
since any of the pathogens RNA that escapes remains a 
problem for the host. 

Another problem with in vivo ribozyme applications 
is that a high ribozyme to substrate ratio is required 
for ribozyme inhibitory function in nuclear extracts 
and it has been difficult to achieve such ratios. 
Cotton et al. achieved a high ribozyme to substrate 
ration by microinjection of an expression cassette 
containing a ribozyme-producing gene operably linked to 
a strong tRNA promoter (a polymerase III promoter) in 
frog oocytes, together with substrate RNA that contains 
the cleavage sequence for the ribozyme (Cotton, M. et 
al., EMBO J. 8:3861-3866 (1989).- However, 
microinjection is not an appropriate method of delivery 
in multicellular organisms. 
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The in vivo activity of rib zymes designed against 
mRNA coding for Escherichia coli 0-galactosidase has 
been reported (Chuat, J.-C. et al., Biochem. Biophys. 
Res. commun. 152:1025-1029 (1989)). However, this 
activity was only observed when the ribozyme and target 
were transfected into bacterial cells on the same 
molecule. Ribozyme activity was inefficient when 
targeted against an mRNA transcribed from a bacterial 
F episome that possessed the target part of the 0- 
galactosidase gene. 

Thus, current technological applications of 
ribozyme activities are limited to those which propose 
to utilize a ribozyme «s cleavage activity to destroy 
the activity of a target RNA. Unfortunately, such 
applications often require complete destruction of all 
target RNA molecules, and/or relatively high 
ribozyme: substrate ratios to ensure effectiveness and 
this has been difficult to achieve. Most importantly, 
the modified ribozymes of the art are not capable of 
efficient, directed trans -splicing. 

Accordingly, a need exists for the development of 
highly efficient ribozymes and ribozyme expression 
systems. Especially, the art does not describe an 
effective means in which to destroy anexisting RNA 
sequence or to alter the coding sequence of an existing 
RNA by the trans-splicing of a new RNA sequence into a 
host's RNA. 

gTTMMARY OF THE INVENTION 

Recognizing the potential for the design of novel 
ribozymes, and cognizant of the need for highly 
efficient methods to alter the genetic characteristics 
of higher eukaryotes in vivo, the inventors have 
investigated the use of ribozymes to alter the genetic 
information of native RNA's in vivo. These efforts 
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have culminated in the development f highly f fective 
trans-splicing ribozym s, and guidelines for the 
engineering thereof* 

According to the invention, there is first 
provided an RNA or DNA molecule, such molecule encoding 
a trans-splicing ribozyme, such ribozyme being capable 
of efficiently splicing a new 3* exon sequence into any 
chosen target RNA sequence in a highly precise manner, 
in vitro or in vivo, and such molecule being novel in 
the ability to accomodate, any chosen target RNA or 3 1 
exon sequences, and in the addition of a complementary 
sequence which enhances the specificity of such 
ribozyme. 

According to the invention, there is also provided 
an RNA or DNA molecule, such molecule encoding a 
ribozyme, the sequence for such ribozyme being a fusion 
RNA, such fusion RNA providing a first RNA sequence 
that is sufficient for targeting such ribozyme to 
hybridize to a target RNA, and further a second RNA 
sequence, such second RNA sequence capable of being 
transposed into the target RNA , and such second RNA 
sequence encoding an RNA sequence foreign to the 
targeted RNA sequence. 

According to the invention, there is further 
provided an RNA or DNA molecule, such molecule encoding 
a ribozyme, the sequence for such ribozyme being a 
fusion RNA as described above, the first RNA sequence 
provided by the fusion RNA being a sequence for 
targeting such RNA molecule to hybridize to GAL4 RNA, 
and the second RNA sequence of the fusion RNA providing 
the coding sequence of the A chain of diphtheria toxin 
(DTA) • 

According to the invention, there is also provided 
an RNA or DNA molecule, such molecule encoding a 
conformational^ disrupted ribozyme of the invention, 
a pro-ribozyme, such pro-ribozyme being substrate- 
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activat d, that is, such pro-ribozym possessing 
neglible or no self-cleavage r trans-splicing 
activity, until being react iyed by specific interaction 
with target RNA. 

According to the invention, there is further 
provided an RNA or DNA molecule containing a ribozyme 
or pro-ribozyme expression cassette, such cassette 
being capable of being stably maintained in a host, or 
inserted into the genome of a host, and such cassette 
providing the sequence of a promoter capable of func- 
tioning in such host, operably linked to the sequence 
of a ribozyme or pro-ribozyme of the invention. 

According to the invention, there is further 
provided an RNA or DNA molecule containing a ribozyme 
or pro-ribozyme expression cassette, such cassette 
being capable of being stably inserted into the genome 
of a host, such ribozyme expression cassette providing 
the sequence of a GAM -responsive promoter operably 
linked to the sequence of a ribozyme or pro-ribozyme of 
the invention. 

• According to the invention, there is-,. further 
provided a method for in-vitro trans-splicing, such 
method comprising the steps of (1) providing a ribozyme 
or pro-ribozyme of the invention and an appropriate 
substrate for such ribozyme in vitro, (2) further 
providing in vitro reaction conditions that promote the 
desired catalytic activity of such ribozyme or pro- 
ribozyme; and (3) allowing such ribozyme or pro- 
ribozyme to react with such substrate under such 
conditions. 

According to the invention, there is further 
provided a method for in vivo trans -splicing, such 
method comprising the steps of (1) providing an RNA or 
DNA molecule of the invention to a host cell, (2) 
expressing the ribozyme or pro-ribozyme encoded by such 
molecule in such host cell, (3) expr ssing a substrate 
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of such ribozyine or pro-ribozyme in such host cell, and 
(4) allowing such ribozyine or pro-ribozyme to react 
with such substrate in such host cell. 

According to the invention, there is further 
provided a method for inactivating the activity of a 
target RNA, such method comprising (1) providing a 
ribozyine or pro-ribozyme of the invention, such 
ribozyine or pro-ribozyme being catalytically active 
against such target RNA, (2) providing such target RNA, 
and (3) providing conditions that allow such ribozyme 
or pro-ribozyme to express its catalytic activity 
towards such target RNA. 

According to the invention, there is further 
provided a method for providing a desired genetic 
sequence to a host cell in vivo, such method comprising 
(1) providing a ribozyme or pro-ribozyme of the 
invention to a desired host cell, such ribozyme or pro- 
ribozyme being catalytically active against a target 
RNA in such host cell, (2) providing such ribozyme or 
pro-ribozyme encoding such desired genetic sequence, 
v ^and (3)" providing conditions that allow such ribozyme 
or pro-ribozyme to trans-splice such desired genetic 
sequence into the sequence of the target RNA. 

According to the invention, there is further 
provided a method for cell ablation in multicellular 
plants and animals, such method comprising providing a 
ribozyme or pro-ribozyme of the invention to a any host 
cell, and especially into a fertilized embryonic host 
cell, such ribozyme or pro-ribozyme encoding the 
sequence of a gene toxic to such host cell and such 
ribozyme or pro-ribozyme being capable of trans- 
splicing with a desired target in such host cell. 

According to the invention, there is further 
provided a method for engineering male or female 
sterility in agronomically important plant species, 
such method comprising the ablation of any cell 
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necessary for f rtility using a ribozyme r pro- 
ribozyme of th invent i h. 

According to the invention, there is further 
provided a method of immunizing plants against plant 
5 pathogens, such method comprising the construction of 

transgenic plants capable of expressing a plant 
pathogen-specific fusion ribozyme or pro-ribozyme of 
the invention, and such ribozyme or pro-ribozyme being 
capable of ablating any host cell infected with such 

10 pathogen . 

According to the invention, there is further 
provided a transformed, pathogen-resistant 
microorganism, such microorganism being resistant to a 
desired pathogen, such microorganism being transformed 
with a ribozyme or pro-ribozyme of the invention and 
such ribozyme or pro-ribozyme providing a catalytic 
activity that targets a nucleic acid molecule expressed 

by such pathogen. 

According to the invention, there is further 
provided a viral pathogen capable of delivering a 



15 



20 



25 



30 



desiredvribpzyins^pro-ribozyme activity to a desired^ ^ 
host, such ribozyme or pro-ribozyme activity being A 
delivered by a ribozyme or pro-ribozyme of the 
invention. 

fif. Sf PTPTION Q F THE FIGURES 



Figure 1 is a diagram of the mechanism of ribozyme 
splicing of the group I intron. 

Figure 2 is a diagram of structure of the (A) 
Tetrahymena thermophila rRNA intron; (B) Target mRNA 
and trans-splicing ribozyme or pro-ribozyme of the 
invention. 

Figure 3(A) is a diagram of the design of a CAT- 
35 L acZ a-peptide trans-splicing ribozyme; (B) is the 

complete DNA coding sequence of the CAT-LacZ ribozyme. 
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Figure 4 presents the sequences f cucumber mosaic 
virus (CMV) RNA 4 trans-splicing ribozymes. A: virus 
RNA target sequences; B: Oligonucleotide target 
sequences; C: CMV RNA4 - diphtheria toxin A-chain 
i 5 trans-splicing ribozymes. 

Figure 5 is a comparison of cucumber mosaic virus 
3/4 sequences. 

Figure 6(A) is a diagram of the design of a Gal4- 
Diphtheria toxin A (DTA) trans-splicing ribozyme; (B) 
xo is the complete coding sequence of the Gal 4 -DTA 

ribozyme with the isoleucine substitution. 

Figure 7 is a diagram of the P-element mediated 
"enhancer-trapping" method for expression of Gal4 
protein. 

15 Figure 8 presents a partial sequence of wild-type 

DTA and DTA 3' exon mutants. 

Figure 9 is a map of pGaTB and pGaTN. 
Figure 10 is a map of pUAST. 

Figure 11 is a cuticle preparation of a Drosophila 

20 embryos expressing a Gal4-DTA trans-splicing ribozyme. 

Figure -l2--'presents---the rationale- for /^#^ J ;"^ : ^^^| 
ribozyme" design. Arrows show sites of ribozyme" 
cleavage, "antisense" regions are shown in black, 
catalytic domains are shown with radial shading, and 3' 

25 "exon" sequences are shown with light shading. In the 

absence of the target mRNA, trans-splicing ribozymes 
may transiently base-pair, and react with heterologous 
sequences (including their own). In addition, scission 
at the "3' exon" junction will occur. Inactive "pro- 

30 ribozymes" are constructed to contain extra self- 

complementary sequences which cause the catalytic 
center of the ribozyme to be mis-folded. Active 
ribozymes are only formed after base-pairing with the 
intended target mRNA - and consequent displacement of 

35 the interf ring secondary structure. 
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Figure 13 shows th s qu nee and predicted 
secondary structur of th CAT-LacZ trans-splicing 
ribozyxne. Ribozyme "core" sequences are shaded (after 
Cech, Gene 73:259-271 (1988)). Helices P8 are shown 
for the unmodified ribozyme and pro-ribozymes 1 and 2, 
with 13 and 18 nucleotides, respectively, of sequence 
complementary to the "antisense" region (highlighted) . 

Figure 14 shows (1) active CAT-LacZ ribozyme shown 
schematically, with "antisense" , ribozyme domain with 
helix P8 and 3' "exon" sequences; (2) (a) inactive CAT- 
LacZ pro-ribozyme 2 shown with base-pairing between 
sequences in the modified helix P8 and the "anti-sense- 
region; and (b) the active pro-ribozyme, after base- 
pairing with the CAT mRNA, displacement of the helix P8 
- "antisense" pairing, and re-formation of helix P8. 

Figure 15 shows stability of CAT-LacZ pro-ribozyme 
transcripts. Plasmids containing the CAT-LacZ ribozyme 
and pro-ribozyme sequences were cleaved with £coRI and 
transcribed using T7 or SP6 RNA polymerase and [32- 
P]UTP. Radiolabeled transcripts were fractionated by 
5% polys* cry lamide gel . electrophoresis in 7M urea and ^ 
25% formamide, and autorad'iographed. The. ribozymelf 
transcripts underwent extensive hydrolysis, primarily 
at the "3' exon w junction. The pro-ribozyme forms were 
markedly less reactive. 

Figure 16 shows endoribonuclease activity of CAT- 
LacZ pro-ribozymes. Plasmids containing CAT-LacZ 
ribozyme and pro-ribozyme sequences were cleaved, with 
Seal, and transcribed with T7 or SP6 RNA polymerase. 
Transcripts were incubated for 30' at 37»C, 45»C and 
50°C in 40 mM Tris-HCl pH 7.5, 6 mM MgCl 2 , 2 mM 
spermidine, 10 mM NaCl, 2 mM GTP with radiolabeled CAT 
RNA, transcribed using T7 RNA polymerase from plasmid 
cut with PuvII. Products were fractionated by 5% 
polyacrylamide gel electrophoresis in 7M urea and 25% 
formamide, and autoradiographed. RNA mediated cleavage 
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f th 173 nt (nucleotides) CAT RNA pr duces 5' and 3' 
fragments of 76 nt and 97 nt, r spectively. 

Figure 17 shows the w wild-type w and modified 
helices P8 used for pro-ribozyme design with possible 
base-pairs indicated in schematic form. Those bases 
which are complementary to the "anti-sense" portion of 
the corresponding pro-ribozyme, are shown in bold type. 
The number of complementary bases is listed next to 
each helix. The helices are ordered by the stability 
of the corresponding pro-ribozyme transcripts, as 
measured by the degree of "3' exon" hydrolysis during 
in vitro transcription. 

Figure 18 shows the stability of GAL4-DTA pro- 
ribozymes. Plasmids containing ribozyme and pro- 
ribozyme sequences were linearized with Xhol and 
transcribed using T7 RNA polymerase. Transcripts were 
incubated for 60' at 50°C n 40 mM Tris-HCl pH 7.5, 6 mM 
MgCl 2 , 2 mM spermidine, 10 mM NaCl, 1 mM GTP, were 
fractionated by 5% polyacrylamide gel electrophoresis 
in 7M urea and 25% formamide,, and autoradiographed. 
Ribozyme transcripts are extensively hydrolysed under.^ 
these conditions, while pro-ribozyme 1 is less so and||. 
pro-ribozyme 2 is stable. 

D'ETAT LED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

In the description that follows, a number of terms 
used in recombinant DNA (rDNA) technology are 
extensively utilized. In order to provide a clear and 
consistent understanding of the specification and 
claims, including the scope to be given such terms, the 
following definitions are provided. 

Ribozvme . An RNA molecule that inherently 
possesses catalytic activity. 
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yrans-splice . A form f genetic manipulation 
whereby a nucleic acid sequence of a first 
polynucleotide is co-linear ly linked to or inserted . * 

into the sequence of a second polynucleotide, in a 
5 manner that retains the 3'-* 5' phosphodiester linkage 

between such polynucleotides. By "directed" trans- 
splicing or "substrate-specific" trans-splicing is 
meant a trans-splicing reaction that requires a 
specific specie of RNA as a substrate for the trans- 

10 splicing reaction (that is, a specific specie of RNA in 

which to splice the transposed sequence). Directed 
trans-splicing may target more than one RNA specie if 
the ribozyme or pro-ribozyme is designed to be directed 
against a target sequence present in a related set of 

15 RNAs . 

Target RNA . An RNA molecule that is a substrate 
for the catalytic activity of a ribozyme or pro- 
ribozyme of the invention. 

pypression Cassette . A genetic sequence that 
20 provides sequences necessary for the expression of a _ 

ribozyme or pro-ribozyme of the invention. |£ ...^ 

stably . By "stably" inserting a sequence into a iff 
genome is intended insertion in a manner that results 
in inheritance of such sequence in copies of such 

25 genome. 

op erable linkage. An "operable linkage" is a 
linkage in which a sequence is connected to another 
sequence (or sequences) in such a way as to be capable 
of altering the functioning of the sequence (or 

30 sequences). For example, by operably linking a 

ribozyme or pro-ribozyme encoding sequence to a 
promoter, expression of the ribozyme or pro-ribozyme 
encoding sequence is placed under the influence or 
control of that promoter. Two nucleic acid sequences 

35 (such as a ribozyme or pro-ribozyme encoding sequence 

and a promoter region sequence at the 5" end of th 
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encoding sequence) are said to be operably linked if 
induction of promoter function results in th 
transcription of the ribozyme or pro-ribozyme encoding 
sequence and if the nature of the linkage between the 
5 two sequences does not (1) result in the introduction 

of a frame-shift mutation, (2) interfere with the 
ability of the expression regulatory, sequences to 
direct the expression of the ribozyme. Thus, a 
promoter region would be operably linked to a nucleic 
10 acid sequence if the promoter were capable of effecting 

the synthesis of that nucleic acid sequence. 

II. Engineering of the Ribozvme of the Invention 

15 The trans-spl icing ribozymes, pro-ribozymes and 

methods of the invention provide, for the first time, 
a ribozyme capable of directed trans-splicing into any 
RNA sequence, and especially into mature (non-intron- 
containing) mRNA. The trans-splicing ribozyme as 

20 described herein, with its extended complementarity to 

the target, greatly differs from 3\, thermophile* derived; 
endoribonuclease activities described in the art. The 
additional complementarity of the ribozymes of the 
invention confers increased affinity and specificity 

25 for the target and the complementarity is not an 

integral part of the catalytic activity. In addition, 
cleavage occurs efficiently and precisely in the 
absence of denaturants and at high concentrations of 
Mg**. 

30 The guidelines described herein for the design of 

trans-splicing ribozymes are conservative, based on the 
well characterized properties of group I self-splicing 
introns and are meant to provide a general scheme for 
the design of any directed trans-splicing ribozyme. 

35 Accordingly, the guidelines presented herein are not 

limited to the group I intron of the T. thermophila 
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pre-mRNA and may be used by one of skill in th art to 
design a ribozyme of the invention with ther group I 
introns using such guidelines and knowledge in the art. 

The native T. thermophila ribozyme (the intron 
sequence) is located from base 53 to base 465 in the 
sequence below of the T. thermophila extrachromosomal 
rDNA: 

TGACGCAATT CAACCAAGCG CGGGTAAACG GCGGGAGTAA CTATGACTCT 
CTAAATAGCA ATATTTACCT TTGGAGGGAA AAGTTATCAG GCATGCACCT 
CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 
GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 
15 GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 
GGATGCAGTT CACAGACTAA ATGTCGGTCG GGGAAGATGT ATTCTTCTCA 
TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 
CAACACTGGA GCCGCTGGGA ACTAATTTGT ATGCGAAAGT ATATTGATTA 
GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 
CGCGCATGAA TGGATTA tSEQIDNC.l] " 

30 (Ran, N.C. et al., Wucl. Acids J?es. 20:2809-2822 

(1982)). 

As described herein, the directed trans-splicing 
ribozymes of the invention are engineered using the 
catalytic core of this intron. The intron, and its 
catalytic core can be isolated by methods known in the 
art. The catalytic core of the intron, that is, the 
truncated intron, differs form the full-length intron 
only in that it is truncated at the Seal site, thus 
removing the last five nucleotides of the intron. The 
truncated intron RNA may be prepared by techniques 
known in the art or may be purchased commercially in 
kit form from comm rcial sources such as, for example, 
product #72000 from US Biochemical, CI veland, OH 
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(RNAzyme™ Tet 1.0 Kit). This US Biochemical kit 
provides ribozym and the protocol for th use of the 
ribozyme. Transcribed Tet.l cDNA nay be used as the 
substrate for polymerase chain reaction (PCR) 
mutagenesis as described below, to produce a synthetic 
trans-spl icing enzyme. 

Substrate specificity of the ribozyme of the 
invention, that is, the ability of the ribozyme to 
"target" a specific RNA as a substrate, is conferred by 
fusing complementary sequences specific to the target 
(substrate) RNA to the 5 f terminus of the ribozyme. 

Directed trans-splicing specificity of the 
ribozyme of the invention, that is, specificity in 
trans-splicing a desired foreign sequence of interest 
with the sequence of a target RNA, is conferred by 
providing a new 3 1 exon at the 3 1 terminus of the 
ribozyme. Details of the design are further provided 
below. 

To alter the structural and catalytic properties 
of the Group I introns, exon sequences replace the 
flanking sequence :©£ sire* ' introns so that only the r 
catalytic core of the intron, the ribozyme, remains. 
The resulting modified ribozyme can interact with 
substrate RNAs in trans. When truncated forms of the 
intron (i.e., the catalytic "core, 11 i.e. truncated at 
the Seal site, removing the last five nucleotides of 
the intron) are incubated with sequences corresponding 
to the 5' splice junction of the native ribozyme, the 
site undergoes guanosine-dependent cleavage in mimicry 
of the first step in splicing. 

Engineering of the ribozymes of the invention 
requires consideration of the four guidelines that 
follow- 
First, a splice site must be chosen within the 
targ t RNA. In the final trans-splicing complex, only 
the 5 1 portion of the PI duplex is contributed by the 
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target RNA. Only a single conserv d residu , uracil, 
is required immediately 5' of the intended splice site. 
This is the sole sequence requirement in the target 
RNA. There is no inate structure required of the target 
RNA. Mature mRNA may be targeted and the trans- 
splicing reaction performed in the cell's cytoplasm 
rather than in the nucleus against pre-mRNA. This 
obviates the need for high concentrations of ribozyme 
in a cell's nucleus. 

Second, having chosen a particular target 
sequence, compensating sequence changes must be added 
to the 5« section of the ribozyme in order to allow the 
formation of a suitable helix PI between the target and 
ribozyme RNAs. It is highly desired is that the helix 
PI should contain a U:G base-pair at the intended 5- 
splice site, and should be positioned at the 4th, 5th 
(preferred) or 6th position from the base of the helix 
(Doudna, J.A. , et al., "RNA Structure, Not Sequence 
Determines The 5' Splice-Site Specif icity of a Group I 
Intron#w ^proc.,,Nat2. Acad. Sci. WW 96:7402-7406 
(1989), incorporated herein by reference). For the" 
native T. thermophila intron, PI extends for an 
additional 3 base pairs past the intended 5« splice 
site, and, in a preferred embodiment, this is 
maintained in the trans-splicing ribozyme of the 
invention. For trans-splicing to be efficient, the 
substrate and endoribonucleolytic intron RNAs must 
base-pair to form helix PI, with a resulting wobble U:G 
base-pair. Cleavage of the target RNA occurs at the 
phosphodiester bond immediately 3' to (after the) 0:G 
base-pair. Phylogenetic comparisons and mutational 
analyses indicate that the nature of the sequences 
immediately adjacent the conserved uracil residue at 
the 5" splic site are unimportant f r catalysis, 
provided th bas -pairing of helix PI is maintained. 
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Third, the exon sequences flanking the 3» splice 
site roust be chosen, and adjustments made in the 5' 
section of the ribozyme, if necessary, to allow the 
formation of a stable P10 helix. While the P10 helix 
may be dispensesd with if necessary, its presence 
enhances splicing and preferred embodiments of the 
ribozyme of the invention retain the P10 helix (Suh, 
E.R. et al., "Base Pairing Between The 3« Exon And An 
Internal Guide Sequence Increases 3 • Splice Site 
Specificity in the Tetrahymena Self-Splicing rRNA 
Intron," Hoi. CeJJ. Biol. 20:2960-2965 (1990)). The 
helices Pi and P10 overlap along the T. therjnophila 
intron IGS, and the 2nd and 3rd residues following both 
the 5 1 and 3' splice sites are complementary to the 
same residues in the IGS (Figure 2). While there may 
be some advantage in following this, many natural group 
I introns do not share this constraint, so the choice 
of 3 1 exon sequences may be determined primarily by 
experimental considerations. Such considerations 
reflect the wide flexibility in choice of splice sites. 
For example, if it is desired to join two sequences at 
a given point, the sequence at such point cannot be 
mutated or otherwise altered by the tra*s-spl icing 
event. Either PI or P10 can be made shorter if the 
overlapping sequences don't otherwise accomodate for 
the desired splice site. 

The sequence requirements for 3 • splice-site 
selection appear to lie mainly within the structure of 
the intron (the ribozyme) itself, including helix P9.0 
and the adjoining 3' guanosine residue which delineates 
the 3 1 intron boundary. P9.0 is wholly contained within 
the intron sequences and helps define the adjacent^ 1 
splice site. For the tra J3S-spl icing design, the P9.0 
helix and the rest of the functional RNA elements 
within the intron are not altered. The structural 
characteristics of the P9.0 helix are known (Michel, F. 
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tal., "The Guanosine Binding Sit f the Tetrahymena 
Ribozym Watur 342:391-395 (1989)). How ver, 
flanking sequences within the 3« exon are required for 
the formation of helix P10 and efficient splicing, as 
shown by mutational analysis. 

Fourth, a region of complementary sequence is 
placed at the 5- terminus of the trans-splicing 
ribozyme in order to increase its affinity and 
specificity for the target RNA. As shown herein, an 
arbitrary length of around 40 residues has been used. 
Other lengths may be used provided they are not 
detrimental to the desired effect. 

For example, starting with the T. thermophila 
self-splicing intron (diagrammed below) : 

PI I 0 A C C A A 
CDCUCUAAA U 

* * I'll* A 
CGCAGGUOyCCAUUU 

i I I I II I ,, 

riboiyn* core •« * A G^GU A ... 3 

(The «i« and "2" in the above diagram {arid, in other 
ribozyme diagrams throughout the application) note the 
first and second splice sites, respectively.) 

(1) a n 5 ,n site is chosen adjacent to a uracil 
residue within a chosen target RNA. The sequences 
involved in complementarity do not immediately abut 
sequences involved in PI helix formation but are 
separated, for example, by five nucleotides also 

involved in P10 formation; 

(2) sequences complementary to the chosen RNA are 

fused to the 5' portion of the self-splicing Group I 
intron. Base-pairing between ribozyme and target RNA 
allow formation the of the helix PI; 

. (3) the chosen -3' exon" sequences are fused to 
the 3' portion of the ribozyme, maintaining the 
conserved helix P10; and 
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(4) to increase affinity for the target RNA, if 
desired, a section of extended sequence complem ntarity 
is fused to the 5 V portion of the ribozyme to allow the 
formation of 30-40 base-pairs. 

The alignment of the resulting trans-splicing 
ribozyme with its target RNA may be diagrammed as shown 
immediately below. The target RNA sequence represents 
the top line. The ribozyme sequence is aligned below 
it, a continuous sequence wrapping around the lower two 
lines wherein the hybridization of the nucleotides at 
the 5* and 3* ends and PI and P10 of the ribozyme may 
be seen. 
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According to the invention, trans-splicing 
ribozym s can be designed that will trans-splice 
essentially any RNA sequence onto any RNA target. It 
is not necessary that the target contain an intron 
sequence or that the ribozyme be an intron in the 
target sequence. For example, a strategy for such 
design may include (1) the identification of the 
desired target RNA (2) cloning and/or sequencing of the 
desired target RNA or portion thereof (3) selection of 
a desired coding sequence to trans-splice into the 
target RNA, (4) the construction of a ribozyme of the 
invention capable of hybridizing to such target using 
the guidelines herein and (5) confirmation that the 
ribo2yme of the invention will utilize the target as a 
substrate for the specific trans-splicing reaction that 
is desired and (6) the insertion of the ribozyme into 
the desired host cell. 

Choice of a target RNA will reflect the desired 
purpose of the trans-splicing reaction. If the purpose 
of the reaction is to inactivate a specific RNA, then 
such RNA must be trans-spliced at a position that 
destroys all functional peptide domains encoded by such 
RNA and at a position that does not result in continued 
expression of the undesired genetic sequences. If more 
than one allele of .the gene encoding such RNA exists, 
the ribozyme should preferably be designed to 
inactivate the target RNA at a site common to all 
expressed forms. Alternatively, more than one ribozyme 
may be provided to the cell, each designed to 
inactivate a specific allelic form of the target RNA. 

When only inactivation of the target RNA is 
desired, and not the expression of a new, desired RNA 
sequence, it is not necessary that the foreign RNA 
donated by the ribozyme provide a sequence capable of 
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being translat d by th host cell, and a sequence 
containing translational stop codons may be used as a 
truncated intron, for example, the intron ribozyme 
truncated at the Seal site. 

If the purpose of the trans -splicing reaction is 
to provide a genetic trait to a host cell, then the 
choice of target RNA will reflect the desired 
expression pattern of the genetic trait. If it is 
desired that the genetic trait be continuously 
expressed by the host, then the target RNA should also 
to be continuously expressed. If it is desired that 
the genetic trait be selectively expressed only under 
a desired growth, hormonal, or environmental condition, 
then the target RNA should also be selectively 
expressed under such conditions. 

It is not necessary that expression of the 
ribozyme itself be selectively limited to a desired 
growth, hormonal, or environmental condition if the 
substrate for such ribozyme is not otherwise present in 
the host as the ribozyme itself is not translated by 
the host. ThusT sequences 'encoded by the RNA donated by • 
the ribozyme of the invention are not translated in a 
host until the trans-splicing event occurs and such 
event may be controlled by the expression of the 
ribozyme substrate in the host. 

If desired, expression of the ribozyme may be 
engineered to occur in response to the same factors 
that induce expression of a regulated target, or, 
expression of the ribozyme may be engineered to provide 
an additional level of regulation so as to limit the 
occurrence of the trans-splicing event to those 
conditions under which both the ribozyme and target are 
selectively induced in the cell, but by different 
factors, the combination of those factors being the 
undesired event. Such regulation would allow the host 
cell to express the ribozym 's target under those 
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conditions in which the ribozyme itself was not co- 
expressed. 

The sequence of the ribozyroe domain that 
hybridizes to the target RNA is determined by the 
sequence of the target RNA. The sequence of the target 
RNA is determined after cloning sequences encoding such 
RNA or after sequencing a peptide encoded by such 
target and deducing an RNA sequence that would encode 
such a peptide. Cloning techniques known in the art 
may be used for the cloning of a sequence encoding a 
target RNA. 

The selection of a desired sequence to be trans- 
spliced into the target RNA (herein termed the "trans- 
spliced sequence") will reflect the purpose of the 
trans-splicing. If a trans-splicing event is desired 
that does not result in the expression of a new genetic 
sequence, then the trans-spliced sequence need not 
encode a translatable protein sequence. If a trans- 
splicing event is desired that does result in the 
expression of a new genetic sequence, and especially a 
new peptide, or protein sequence, then the £rans-s?licc?Gi 
sequence may further provide translationai stop codons, 
and other information necessary for the correct 
translationai processing of the RNA in the host cell. 
If a specific protein product is desired as a result of 
the trans-splicing event then it would be necessary to 
maintain the amino acid reading frame in the resulting 
fusion. 

The identification and confirmation of the 
specificity of a ribozyme of the invention is made by 
testing a putative ribozyme 1 s ability to catalyze the 
desired trans-splicing reaction only in the presence of 
the desired target sequence. The trans-splicing 
reaction should not occur if the only RNA sequences 
present are non-target sequences to which such ribozyme 
should not be responsive (or less responsive) . Such 
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characterization nay be performed with the assistance 
of a marker such that corr ct (or incorr ct) rib zyme 
activity may be more easily monitored. In most cases 
it is sufficient to test the ribozyme against its 
intended target in vitro and then transform a host cell 
with it for study of its in vivo effects. 

When it is desired to eliminate a host's RNA, such 
elimination should be as complete as possible. When it 
is desired to provide a new genetic sequence to a host 
cell, the trans-splicing reaction of the invention need 
not be complete. It is an advantage of the invention 
that, depending upon the biological activity of the 
peptide that is translated from such genetic sequence, 
the trans-splicing event may in fact be quite 
inefficient, as long as sufficient trans-splicing 
occurs to provide sufficient mRNA and thus encoded 
polypeptide to the host for the desired purpose. 

Transcription of the ribozyme of the invention in 
a host cell occurs after introduction of the ribozyme 
gene into the host cell. If the stable retention of 
the ribozyme by the host cell is not desired, such 
ribozyme may be chemically or enzymatically synthesized 
and provided to the host cell by mechanical methods, 
such as microinjection, liposome-mediated transfection, 
electroporation, or calcium phosphate precipitation. 
Alternatively, when stable retention of the gene 
encoding the ribozyme is desired, such retention may be 
achieved by stably inserting at least one DNA copy of 
the ribozyme into the host's chromosome, or by 
providing a DNA copy of the ribozyme on a plasmid that 
is stably retained by the host cell. 

Preferably the ribozyme of the invention is 
inserted into the host's chromosome as part of an 
expression cassette, such cassette providing 
transcriptional regulatory elements that will control 
the transcription of the ribozyme in the host cell. 
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Such elements may include, but not n c ssarily be 
limited to f a promoter element, an enhancer or UAS 
element, and a transcriptional terminator signal. 
Poly adenylat ion is not necessary as the ribozyme is not 
translated. However, such polyadenylation signals may 
be provided in connection with the sequence encoding 
the element to be trans-spliced. 

Expression of a ribozyme whose coding sequence has 
been stably inserted into a host's chromosome is 
controlled by the promoter sequence that is operably 
linked to the ribozyme coding sequences. The promoter 
that directs expression of the ribozyme may be any 
promoter functional in the host cell, prokaryotic 
promoters being desired in prokaryotic cells and 
eukaryotic promoters in eukaryotic cells. A promoter is 
composed of discrete modules that direct the 
transcriptional activation and/or repression of the 
promoter in the host cell. Such modules may be mixed 
and matched in the ribozyme' s promoter so as to provide 
for the proper expression of the ribozyme in the host, 
-s *»<-.; eukaryotic promoter snay Se any ptcteoter functional in^> : 
eukaryotic cells, and especially may be any of an RNA 
polymerase I, II or III specificity. If it is desired 
to express the ribozyme in a wide variety of 
eukaryotic host cells, a promoter functional in most 
eukaryotic host cells should be selected, such as a 
rRNA or a tRNA promoter, or the promoter for a widely 
expressed mRNA such as the promoter for an actin gene, 
or a glycolytic gene. If it is desired to express the 
ribozyme only in a certain cell or tissue type, a cell- 
specific (or tissue-specific) promoter elements 
functional only in that cell or tissue type should be 
selected. 

The trans-splicing reaction is chemically the same 
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whether it is performed in vitro r in vivo. How ver, 
in vivo, since cof actors are usually already present in 
the host cell, the presence of the target and the 
ribozyme will suffice to result in trans-splicing. 

The trans-splicing ribozymes and methods of the 
invention are usful in producing a gene activity useful 
for the genetic modification, and/or cell death, of 
targeted cells. For example, the trans-splicing 
reaction of the invention is useful to introduce a 
protein with toxic properties into a desired cell. The 
susceptibility of cells will be determined by the 
choice of the target RNA and the regulatory controls 
that dictate expression of the ribozyme. For example, 
a ribozyme that transposes an RNA sequence encoding a 
toxic protein may be engineered so that expression of 
the ribozyme will depend upon the characteristics of an 
operably-linked promoter. In a highly preferred 
embodiment, diptheria toxin peptide A is encoded by 
that part of the ribozyme that is transposed into a 
desired target in the host. Conditional expression of 
tne ■« -ribozyme wand diphtheria tojfin peptide A chain, 
results in the death of the host cell. Other useful 
peptide toxins include ricin, exotonin A. and herpes 
thymidine kinase (Evans, G.A.. Genes & Dev. 3:259-263 
(1989)). in addition, various lytic enzymes have the 
potential for disrupting cellular metabolism. For 
example, a fungal ribonuclease may be used to cause 
male sterility in plants (Mariani, C. et al., Nature 
347:737-741 (1990)). Particular tissues might be 
destroyed due to limited expression of the target RNA. 
Further, if a viral RNA is used as target, new forms of 
virus resistance, or therapies may be engineered. 

A binary system for control of tissue-specific 
gene expression and/or for ectopic ablation may be 
design d using the ribozymes of the invention. For 
example, lines of Drosophila that express the yeast 
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transcription activator GAL4 in a tissue and spatial- 
specific pattern using P-element enhancer-trap vectors 
may be used. Any transcriptional activator may be used 
in place of GAL4 and the invention is not intended to 
be limited to GAL4 . A gene encoding a fusion ribo2yme 
that is capable of trans-splicing the DTA sequence may 
be placed under the control of the GAL4-UAS promoter 
and inserted into Drosophila in a genetically stable 
manner. Such ribozyme* will not be expressed in 
Drosophi la in the absen ce of GAL4 . Accord ing ly , 
crossing Drosophila hosts genetically carrying this 
ribozyme construct with Drosophila hosts that express 
GAL4 in a tissue-specif ic manner result in progeny hat, 
when GAL4 expression is induced, exhibit a pattern of 
cell death similar to the pattern of GAL4 expression. 

In addition, by targetting the ribozyme to trans- 
splice with the GAL4 mRNA, the splicing activity of 
the ribozyme inactivates GAL4 expression and ribozyme 
expression may be self -regulated. 

" •Pyp-riho2,ymes.. 

A trans-splicing ribozyme, as described above, 
consists of three fused sequence elements - a 5' "anti- 
sense" region which is complementary to the target RNA, 
the catalytic region which is based on a self -splicing 
Group I intron, and 3' "exon" sequences. The 5' region 
can base pair with the chosen target RNA, to bring it 
into proximity with the catalytic sequences of the 
Group I intron. The structure of the Group I intron 
provides a chemical environment suitable to catalyze 
the precise splicing of the target RNA with the 3' 
"exon" sequences. However, in the absence of the 
appropriate target RNA, the ribozyme sequences can 
still catalyze scission at the 3' "exon" junction 
(similar hydrolysis is seen for Group I s lf-splicing 
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intons (Zaug et al., Science 232:470-475 (1986)), and 
may be able to catalyze illegitimat splicing ev nts 
through transient base-pairing of the ribozyme with 
heterologous RNA sequences (which Bay include their 
own). Such side-reactions and illegitimate splicing 
events are unwanted, and may be deleterious. For 
example, if trans-splicing is to be used for 
conditional delivery of a toxin in vivo, illegitimate 
trans-splicing might result in unexpected expression of 
the toxic activity. Spontaneous cleavage at the 3' 
"exon" junction would lower the efficiency of trans- 
splicing. 

To help avoid these problems, "pro-ribozyme" forms 
of the trans-splicing RNAs have been constructed 
wherein for example, helix P8 is disrupted. The pro- 
-ribozymes are constructed to contain extra self- 
complementary sequences which cause the catalytic 
center of the ribozyme to be mis-folded. The pro- 
ribozymes are inactive in the absence of the intended 
target RNA; active forms are only formed after base- 
pairing of the ribozyme - and target RNAs - with 
consequent displacement of the interfering secondary 
structure within the ribozyme. Pro-ribozymes are 
intended to be catalytically inert species in the 
absence of the target RNA, to eliminate unwanted self- 
cleavage, self-splicing and illegitimate trans-splicing 
reactions in vitro and in vivo (Figure 12). 

The pro-ribozymes described here are conformation- 
ally disrupted and therefore inactive forms of the 
trans-splicing activities. Thus the pro-ritozymes 
possess little self-cleavage activity. They are only 
re-activated by specific interaction with the target 
RNA, and thus are substrate-activated ribozymes which 
are' less likely to catalyze trans-splicing to an 
unintended target RNA. Trans-splicing ribozymes are 
intended to be used for the delivery f new gene 
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activiti s in vivo, and any reduction in the extent of 
, unwanted side reactions or ill gitimate splicing is 
desirable, and nay be necessary. 

While the disruption of helix P8 has been 
5 exemplified here for the trans-splicing pro-ribozymes, 

other helices which are required for catalytic activity 
could also have been used. 

The sane approach, of disrupting the conformation 
of a catalytically inportant structure in such a way 

Xo that only base-pairing with the intended substrate RNA 

will allow the formation of an active ribozyne, could 
be applied to other ribozyme designs. For example, the 
loop sequence of a "hammerhead" type endoribonuclease 
(Haseloff et al., Nature 334:585-591 (1988)) could be 

15 extended and made conplenentary to one of the "anti- 

sense" arms of the ribozyme - similar to the above 
modification of helix P8. Endoribonuclease activity 
would only be exhibited after base-pairing with the 
chosen target RNA, displacement of the disrupting 

20 secondary structure, and reformation of the stem-loop 

structure required for catalysis- . This would 
effectively increase the specificity of the ribosyne of 
its target. 

In addition, the activation of a pro-ribozyroe need 
25 not rely on base-pairing with the substrate itself. 

Instead, a chosen third RNA or ssDNA or even protein 
might be required for activity. An additional base- 
pairing or RNA-protein interaction would be required 
for the formation of an active ribozyme conplex. The 
30 availability of such additional components would 

determine ribozyme activity, and could be used to alter 
ribozyme selectivity. 

The ribozyme or pro-ribozyme of the invention may 
be introduced into any host cell, prokaryotic or 
35 eukaryotic and especially into a plant or mammalian 

host cell, and especially a human cell, either in 
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culture or in vivo, using t chniqu s known in the art 
appropriate to such hosts. The ribozymes of the 
invention may also be engineered to destroy viruses. 
In one embodiment, the ribozyme or pro-ribozyme of the 
invention is provided in a genetically stable manner to 
a host cell prior to a viral attack. Infection by the 
appropriate virus, or expression of the latent virus in 
such host cell, (resulting in the appearance of the 
ribozyme 's or pro-ribozyme target RNA in the host 
cell), would stimulate the catalytic activity of the 
ribozyme and destruction of the viral RNA target and/or 
production of a- toxin via trans-splicing resulting in 
death of the virus infected cells. In another 
embodiment, the ribozyme or pro-ribozyme may be 
engineered and packaged into the virus itself. Such 
embodiments would be especially useful in the design of 
viruses for investigative purposes, wherein the 
ribozyme or pro-ribozyme may be designed to destroy the 
function of a specific viral RNA and thus allow the 
study of viral function in the absence of such RNA. 
• Viruses' carrying ribozymes tea'y aise- be used as carriers 
to transfect host cells with a desired ribozyme or pro- 
ribozyme activity. 

Male or female sterility may be engineered in 
agronomically important species using the ribozymes or 
pro-ribozymes of the invention. For example, male 
sterility in tobacco may be engineered by targetting 
TA29 or TA13 mRNA (tobacco anther-specific genes; 
Seurinck, J. et al., Nucl .. Acids Res. 18:3403 (1990) 
with a ribozyme or pro-ribozyme of the invention that 
trans-splices the DTA 3» exon into those targets. 

The form of crop plants may be manipulated by 
selective destruction or modification of tissues using 
the ribozymes or pro-ribozymes of the invention. For 
example, seedless fruits may b made by targetting th 
se d storage protein mRNA with a ribozyme or pro- 
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ribozyme of th inv ntion that trans-splices the DTA 3 1 
exon into the target. 

Transgenic plants may be protected against 
infection by expression of virus-specific ribozymes or 
pro-ribozyme to kill infected cells. This would be an 
artificial form the "hypersensitive response." For 
example, cucumber mosaic virus coat protein rtRNA may be 
targeted with a ribozyme or pro-ribozyme of the 
invention that trans-splices the DTA 3 1 exon into the 
target. 

Populations of micro-organisms may be made 
resistant to specific pathogens by introduction of 
trans-splicing ribozymes or pro-ribozymes. For 
example, cheese-making bacteria may be made resistant 
to phage infection by targetting the phage RNA with a 
bacterial toxin gene or lytic enzyme encoded by the 3 1 
exon provided by the ribozyme or pro-ribozyme of the 
invention, for example, which would interfere with 
phage replication by causing premature lysis after 
phage infection. 

1 " Virus pathogens could be constructed; to delivers^ 
toxic activities via trans-splicing. In this way, 
specific cell types could be targeted for ablation; 
such as for cancer or viral therapy. For example, HIV 
mRNA may be targeted by a ribozyme or pro-ribozyme of 
the invention that carries the DTA 3» exon, for either 
virus or liposome delivery. 

The examples below are for illustrative purposes 
only and are not deemed to limit the scope of the 
invention. 
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EXAMPLES 



Example 1 

construct s and characterization of a CAT-LacZ 
Trans-Spl ^na Ribozvme 



I. PCR Amplification a n d Cloning of the Ribozvme of 
10 frhe Invention 

Following the guidelines outlined above, a trans- 
splicing fusion ribozyine was designed that will splice 
15 a portion of the amino-terminal coding sequence of E. 

coli 0-galactosidase (LacZ) mRNA to a site in the 
chloramphenicol acetyl transferase (CAT) mRNA (Figure 
3). The sections of new sequence flanking the T. 
therwophila ribozyine core and the 3' exon were 
20 synthesized as oligonucleotides. The intact ribozyine 

sequence was then assembled by successive polymerase 
chain reactions, using the synthetic adaptor 
oligonucleotides as primers with ribozyme and ft- 

gaiactosidase^ DNA -templates (while there are other^. ^g^;«,ny * • f-$%£ 
25 methods available, this method is most convenient). 

For the construction of a ribozyme capable of 
splicing 0-galactosidase (LacZ) a-peptide coding 
sequence to a site in the 5« coding sequence of the 
chloramphenicol acetyl transferase (CAT), three 
30 oligonucleotides were synthesized. 

Oligonucleotide 1 

5»-GGCCA AGCTT CTTTA CGATG CCATT GGGAT ATATC AACGG 
TGGTA TAAAC CCGTG GTTTT TAAAA GTTAT CAGGC ATGCA CC-3 • 
35 [SEQ ID NO. 2) 

Oligonucleotide 2 

5 * -GATTA GTTTT GGAGT ACTCG TACGG ATTCA CGGCC GTCGT 



TTTAC AA-3' [SEQ ID NO. 3) 



40 
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Oligonucleotide 3 

5'-GGCCG AATTC TTACA ATTTC CATTC AGGCT GCGCA ACTGT TGG- 
3' [SEQ ID NO. 4] 

Oligonucleotides 2 and 3 (200 pinoles each) were 
combined with 0.1 pg JVuXI-cut pGEM4 DNA (which 
contained the LacZ a-peptide sequence) , and subjected 
to PGR amplification in a volume of 100 pi containing: 

50 mM KC1, 

10 mM Tris-HCl pH 8.3, 

1.5 mM MgCl 2 , 

0.4 mM dNTPs, 

0.1% gelatin, and 

5 U TagI DNA polymerase, 

and incubated for 30 cycles, 1 min § 94 °C, 2 mins § 
50°C, 2 mins § 72?C. 

Plasmid pGEM4 is commercially available from 
Promega Corporation, Madison WI # USA. 

The amplified product of 210 base-pairs was 
purified using low-gelling temperature agarose 
electrophoresis, and was used as primer in a second 
found -of FCR amplification, v: • 

Following the second round of PCR amplif ication^ 
2.0 jig of 210 base-pair amplified product, 200 pinoles 
oligonucleotide 1 and 0.1 jig 450 base-pair fragment 
containing the T. theirmophila TVS were mixed and 
subjected to PCR amplification using the conditions 
shown above. The resulting 660 base-pair product was 
digested with the restriction endonucleases EcoRI and 
ffindlll, and cloned into the plasmid vector pGEM4. The 
complete sequence of the CAT-LacZ a-peptide ribozyme 
DNA sequence is presented as SEQ ID NO. 5 and Figure 
3B. 

The cloning vector containing the cloned sequences 
was transformed into, and propagated in, the bacterial 
host XLl/Blue (Strategene, La Jolla, California), using 
techniques known in the art (Maniatis, Molecular 
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Cloning, A Laboratory Guide, 2nd dition, 1989, Cold 
Spring Harbor Laboratory, Publishers) . How ver, any 
bacterial host capable of stably maintaining the vector 
may be used, for example the JM109. 

The plasmid may be extracted from the host cell 
for further analysis using techniques commonly known in 
the art (Maniatis, Molecular Cloning, A Laboratory 
Guide, 2nd edition, 1989, Cold Spring Harbor 
Laboratory, Publishers) . 

II. Tr vitro TV^nficripti nn pf Clonpd Bibpzyroe and 
7»*T- n et RNAs 

Using standard procedures, cloned sequences were 
purified from the bacterial host and the plasmid 
linearized using a restriction endonuclease that does 
not cut the'ribozyme sequence, (for example, ScoRI) , 
and transcribed using T7 RNA polymerase in a volume of 
100 ill, containing : 



imM NTPs (containing 20 *Ci [a- 22 V)\m>, if 
labelled RNA transcripts were desired), 
100 U RNasin, and 
50 U T7 RNA polymerase, 

and the reaction was incubated at 37 *C for 2 hours. 

*NA transcripts were purified by 5% polyacrylamide 
gel electrophoresis before use (TBE, 7M urea gel). 
RNAs containing active T .thermophila IVA sequences 
undergo some spontaneous scission at the 3 • intron-exon 
junction during transcription. Fragments are removed 
by electrophoretic purification for clarity of analysis 
during subsequent trans-splicing assays. 




40 mM Tris-HC pH 7.5, 
6 mM MgCl 2 , 
2 mM spermidine, 
lOmM NaCl, 
lOmM DTT, 




25 
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III. in Vitro Trans-splic ina Reaction Conditions 

Target and/or trans-splicing ribozyraes are 

incubated under the following conditions: 

0.1-0.5 M9 RNA component (amount depends on type of 
experiment, usually ribozyme in 5-fold excess of 
target) , 

30 mM Tris-HCl pH 7.5, 
100 mM NaCl, 
2mM GTP, 
5 mM MgCl 2 , 

in a volume of 5 jil at 42°C, 60 mins. 

The reaction is diluted with 95 Ml 0.1 mM Na 2 EDTA, 
200 mM NaCl, and ethanol precipated. The RNAs are then 
analysed on 5% polyacrylamide gels containing TBE 
buffer, 7M urea and 25% formamide, and 
autoradiographed . 

IV. Assay of Endonucleolvtic Activity 

After base-pairing of the ribozyme and target, the-- 
first step in trans-splicing is the gusnosine medist^^ 
cleavage of the target RNA at the intended 5» splice.7^ 
site. Annealing and trans-splicing may be performed in 
a buffer such as 30 mM Tris-HCl, pH 7.5, 100 mM NaCl, 
5 mM MgCl 2 , 2 mM GTP at 42°C. As the 3 f splice site is 
dispensable for this reaction, truncated trans-splicing 
ribozymes should behave as highly-specific 
endoribonucleases. To test this activity, shortened in 
vitro transcripts of the CAT-LacZ ©-peptide trans- 
splicing ribozyme described above (SEQ ID NO. 5 and 
Figure 3) were incubated with CAT mRNA sequences. The 
CAT-LacZ ribozyme cassette is on a ffindIII-.EcoRI 
fragment. The Seal cleavage site marks a position 5 
bases upstream of the 3' splice site. The ribozyme 
specifically cleaved the target RNA at the expected 
single site to pr duce the expected size fragments. 
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V. The Trans -spl icing Reaction 

To confirm the ability of the CAT-LacZ o-peptide 
ribozyme to catalyze the ligation of 3' exon sequences 
at the 5' splice site, various forms were incubated 
with radiolabeled CAT RNA. Ribozyme transcripts were 
synthesized from DNA templates which had been 3« 
truncated at one of several positions, ranging from the 
end of the ribozyme core through the exon sequence, 
incubation with labelled CAT led to the formation of 
the expected spliced products, which differed in length 
depending on the extent of 3' exon sequence. 

In addition, a certain proportion of the CAT-LacZ 
apeptide ribozyme molecules underwent spontaneous 
cleavage at the 3' splice site during in vitro 
transcription, similar to the intact T. thermophila 
intron. These cleaved forms, terminated at the 
guanosine residue adjacent the 3' splice site, were 
also incubated with CAT RNA. In this case, the ribozyme 
itself is ligated to a 3« portion of the CAT RNA, to 
produce a product of about 550 nucleotides in sizs, * : ^ . ^ 
This reaction is similar to the self-circularization of ,i - 
the intact intron, and the same ligation product is 
found in the other trans-splicing reactions. 

VI. /trcuracv of the Trpn*-*p1 icing 

The products from a CAT-LacZ o-peptide trans- 
splicing reaction were reverse-transcribed, and 
amplified by polymerase chain reaction using two 
oligonucleotides complementary to sequences on either 
side of the predicted splice sites. Amplified 
sequences were cloned and sequenced. Individual 
recombinants showed no variation from the expected 
sequ nc of th spliced products. As found in studies 
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with the intact intron, splicing appears to be highly 
accurate. 

Accordingly, the studies above show that a trans- 
splicing ribozyme designed according to the guidelines 
of the invention is capable of accurate, effective 
trans-splicing in vitro. 

Design of a rran s-Snl icing Ribozvme that Provides 
Plant Virus Resistance 

Cucumber mosaic virus (CMV) is a pandemic virus 
with a large number of known strains. Nine sequence 
strains are shown in the region of the start of their 
coat protein cistron encoded in RNA 3 and the 
subgenomic mRNA 4 (SEQ ID NOS. 7-25; Figures 4(A) and 
5) . Two sites have been chosen which are conserved in 
sequence and downstream from the AUG start codon of the 
coat protein. Oligonucleotides for the construction of 
ribozymes capable of trans-splicing the ile-mutant form 
Uv of "'DTA into the CMV coat protein mRNA are* shown- in 
Figure 4B and is discussed below. 

The trans-splicing ribozymes shown in Figure 4C 
and D are targetted to the CMV virus sequences shown in 
Figure 4B and will result not only in the cleavage of 
the CMV RNA molecules but in the expression of 
diphtheria toxin A-chain in the infected cell. The 
trans-splicing cassettes shown in Figure 4 may be 
transformed into any CMV-susceptible plant species 
using techniques known in the art, and transgenic 
progeny challenged by CMV infection. The design of the 
ribozyme is such that virus infection is necessary to 
initiate toxin production via RNA trans-splicing 
b cause the ribozyme itself is not translated. The 
localized death of the infected cells that results from 
expression of the toxin could limit replication and 
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spread of the virus within the plant giving an 
artificial hypersensitive response. 



Example 3 

rnnsfcruction »nd cha r ter izati on of a Gal4~ 
ninhtharia Tnvin a Chain Tmns-SpUcinq foppzyme 

According to the invention and the methods 
described in Example 1, a fusion ribozyme has been 
designed that is a Ga 14 -Diphtheria toxin A chain trans- 
splicing ribozyme (Figure 6). The sequence of this 
ribozyme is shown as SEQ ID NO. 6. The GAL4-DTA 
ribozyme cassette is a Sall-Xhol fragment. The Seal 
site marks a position 5 bases upstream of the 3' splice 
site. This ribozyme is capable of splicing the coding 
sequence for the A chain of the diphtheria toxin to a 
site in the 5' region of the GAL4 mRNA. This trans- 
splicing activity is active both in vitro (as above) 
and in vivo (below) . The major criteria for successful 
design of the GAD4 -DTA ribozyme, and any Erans*splicing 
ribozyme that trans-splices a sequence encoding a toxic r 
product, are not only the efficient and precise 
catalysis of trans-splicing, but also that expression 
of the toxic product, for example, DTA does not occur 
in the absence of trans-splicing. 

The catalytic portion of the ribozyme is 
constructed according to the design outlined above, and 
5' and 3' splice sites chosen within the 5' coding 
regions of GAL4 and DTA, respectively. The 3' exon 
sequence corresponds to that of a DTA gene already used 
for expression in eukaryotes, except for the removal of 
the first AUG codon and several proximal amino acids. 
The original C. diphtheriae form of DTA also differs in 
this 5' region, utilizing a CUG codon for translation 
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initiation. The original DTA sequence also c ntains a 
signal peptide leader sequence which is absent. 

These ribozyme molecules can undergo spontaneous 
scission at the 3' splice site. Given the extreme 
toxicity of OTA, it is important that any liberated 3' 
exon sequences not give rise to toxic translation 
products. The 3 9 exon contained an in-frame methionine 
at position 13, which could conceivably give rise to a 
truncated but toxic polypeptide. To eliminate this 
possibility, the wild-type sequence (Rz-DTA^t) was 
altered from methionine at this position to isoleucine 
(Rz-DTA ile ) or leucine (Rz-DTA leu ) in two separate 
ribozyme constructions (Figure 6). 

In Vivo Activity of the R ibozvmes of the Invention 

I. Introduction 

The in vivo activity of a ribozyme designed 
according to the guidelines provided herein, and the 
ability of such a' : - ,,5 rlb'6zymes"^to* deliver nev; gene * 
activities to host cells, was demonstrated using the'' 
Gal4-Diphtheria toxin A chain trans-splicing ribozyme 
described (Example 3 and in Figure 6) to deliver the 
highly toxic diphtheria toxin A product to a host cell. 
In this system, Drosophila was the chosen host and it 
was desired to control expression of the ribozyme of 
the invention in a tissue-specific manner within the 
Drosophila host. 

Diphtheria toxin is secreted by CoryneJbacterium 
diphtheriae lysogenic for B phage. The toxin is 
produced as a single polypeptide which undergoes 
proteolysis to produce A and B chains. The A chain 
(DTA) contains a potent ADP ribosylase activity which 
is specific for the eukaryote translation elongation 
factor EF-2. The presenc of even a few molecules of 
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this nzyme is enough to cause cessation of translation 
and eventual death in a variety of eukaryote cells. 
The B chain allows intracellular delivery by attachment 
of the toxin to cell surface receptors by binding 
5 mannose residues, is endocytosed and enters the 

cytoplasm by vesicular fusion. 

In the absence of the B-chain, the A-chain is much 
less toxic when present extracellularly. This 
property, and its extreme toxicity, have suggested its 
10 use for ectopic ablation experiments. For example, 

sequences encoding DTA have been expressed in 
transgenic mice, using an opsin promoter to drive 
expression in developing eyes. The resulting mice are 
blind, with deformed eyes (Breitman, M.L., Science 
15 238:1563-1565 (1987)). In other studies, ablation of 

the mouse pancreas was performed (Palmiter, R.D. et 
al. f Cell 50:435-443 (1987)) and Wert, S.E. et al., Am. 
*ev. Respir. Dis. 141 (no. 4, part 2):A695 (1990) 
described ablation of alveolar cells by use of a 
chimeric gene consisting of the promoter and 5' 
flanking sequence of the human ! surf actant protein C $ 
gene (expressed in type II alveolar cells) and the DTA 
gene . 

However, using this type of approach, it is not 
25 possible to maintain or propagate transformed organisms 

which might have more severe, or lethal phenotypes. In 
addition, transformation of certain species, such as 
Drosophila, with intact DTA sequences has not been 
reported to date. Leaky expression of the DTA gene 
30 during such transformations leads to immediate death. 

II. Tfro Drosn fti'Ja System 

A general method for targeting gene expression in 
35 Drosophila has been d veloped. First, the system 

allows th rapid generation of individual strains in 
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which ectopic gene expression can b dir cted to 
different tissues or cell types: the enhancer detector 
technique is utilized (O'Kane, C.J. and Gehring, W.J., 
Proc. Natl. Acad. Sex. USA: 9123-9127 (1987); Bellen et 
5 al., Genes and Development 3:1288-1300 (1989); Bier et 

al., Genes and Development 3:1273-1287 (1989)) to 
express a transcriptional activator protein in a vide 
variety of patterns in embryos, in larvae and in 
adults. Second, the method separates the activator 

10 from its target gene in distinct lines, to ensure that 

the individual parent lines are viable: in one line 
the activator protein is present but has no target gene 
to activate, in the second line the target gene is 
silent. When the two lines are crossed, the target 

15 gene is turned on only in the progeny of the cross, 

allowing dominant phenotypes (including lethality) to 
be conveniently studied. 

To ectopically express only the gene of interest, 
a transcriptional activator that has no endogenous 

20 targets in flies is required. An activator from yeast, 

.. TiK*'-"' 1 Gal^ # can activate transcription in flies but only fr ; om,^ ^ 
promoters that bear Gal4 binding sites (Fischer et al., ' 
Nature 332:853-865 (1988)) . To target gene expression, 
Gal4 is restricted to particular cells in two ways: 

25 either Gal4 transcription is driven by characterized 

fly promoters, or an enhancer less Gal4 gene is randomly 
integrated in the Drosophila genome, bringing it under 
the control of a diverse array of genomic enhancers. 
To assay transactivation by„Gal4, flies that express 

30 Gal4 are crossed to those bearing a lacZ gene whose 

transcription is driven by Gal4 binding sites (Fischer 
et al., Nature 332:853-865 (1988)). 0-galactosidase is 
expressed only in those cells in which Gal4 is first 
expressed. Tissue- and cell-specific transactivation 

35 f lacZ has been d monstrated in strains in which Gal4 
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is expressed and in which a variety of patterns are 
stablished. 

With this system, it is now possible: 1) to place 
Gal4 binding sites upstream of any coding sequence; 2) 
to activate that gene only within cells where Gal4 is 
expressed and 3) to observe the effect of this aberrant 
expression on development. In cases where ectopic 
expression is lethal, this method allows the two parent 
lines (one expressing Gal4, the other carrying a silent 
gene bearing Gal4 binding sites in its promoter) to be 
stably propagated. Phenotypes can then be studied in 
the progeny of a cross. 

III. vectors 

The vectors utilized as starting materials in 
these studies include: 

1) r fi&TB and pGATW (figure 9): These vectors 
are used for cloning promoters and enhancers upstream 
of a promoter less Gal4 gene. 

Vectors were constructed in which either a unique 
.KQt.J :7i pr. BamHL s site is inserted upstream of the Gal4 ^ 
coding region. Once a promoter has been linked to the 
Gal4 coding sequence, the gene can be excised from the 
pHSREM vector backbone (Knipple and Marsella-Herrick, 
Wucl. Acids J?es. 16:7748 (1988)) and moved into a P- 
element vector. The Rh2 promoter has been cloned 
(Mismer et al., Genetics 120:173-180 (1988)) into this 
vector and flies have been generated in which Gal4 is 
expressed only in the ocelli. 

2) pGawB : This is a Gal4 vector for use in 

enhancer detection. 

An enhancerless Gal4 gene was subcloned into the 
vector plwB (Wilson et al., Genes and Development 
3:1301-1313 (1989)) to create pGawB . plwB was first 
digested with Hindi II to remov the lacZ gene and the 
N-terminus of the P-transposase gene. These were 
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replaced with the ntire Gal4 coding region behind the 
TATA box of the P-transposase gene. 

3) pUAST (Figure 10) : This plasmid was used for 
cloning coding sequences downstream of the Gal UAS. 
5 A vector into which genes can be subcloned behind 

the Gal4 UAS (Upstream Activation Sequence) was 
constructed in the P-element vector, pCaSpeR3 (C. 
Thummel, Univ. of Utah Medical Center, Salt Lake City, 
Utah, personal communication). Five Gal4 binding sites 
10 were inserted, followed by the hsp70 TATA box and 

transcriptional start, a polylinker, and the SV40 
intron and polyadeny lotion site. Unique sites into 
which genes, or cDNAs , can be inserted include: EcoRI, 
Bglll, NotI, Xhol, Kpnl and Xbal. 

15 

IV. Drosophila Strains 

The genetic techniques described herein used to 
characterize the strains of Drosophila utilized in 
these studies are well known in the art ("Genetic 

20 Variations of Drosophila melanogaster," D. Lindsley and 

E*H. Grell, jeds) v 

The P-element transposons are mobilized using the 
" jumpstarter" strain that carries A2-3, a defective 
P-element on the third chromosome that expresses high 

25 levels of a constitutively active transposase 

(Robertson et al., Genetics 118:451-470 (1988)). The 
three stocks currently used to generate and map the 
insertion lines were deposited in the Drosophila Stock 
Center, Indiana University Department of Biology, 

30 Jordan Hall A 503, Bloomington, Indiana 47405: 

1: y w; +/+; Sb P[ry\ A2-3]/TM6, Ubx 
2: w; +/+; TM3, Sb/CxD (deposit no. 3665) 
3: w; CyO/Sco; +/+ (deposit no. 3666) 
where the genetic characteristics of the three 

35 chromosomes are separated by semicolons. Thus, for 

example, in strain 1, the first chromosome (the X 
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chromosome) is homozygous for yellow and white (»y w») , 
the second chromosome is wild-typ ("+/+"), and the 
third chromosome carries the stubble gene ("Sb") , and 
the p element transposon rosy gene ("ry* M ) and A2-3, 
while the second third chromosome carries balancer 
inversions ("/TM6, Ubx"). 

V. ytrateov fo r r.ongratina Ga14 Expression . Patterns 

A. scheme us^ to isolate transf Ptmants 
Constructs are injected into embryos derived from the stock; 
99 y w/y w ; A2-3,Sb/TM6,Ubx X 66 y w/Y ; A2-3,Sb/TM6 f Ubx 

fQ; Establish sin ale lines 

2 y w/y w ; A2-3,Sb/TM6,Ubx X 6 y w/Y; +/+ 



or 



<J y w/Y ; A2-3,Sb/TM6,UbX X 2 y w/y w; +/+ 

TV select f» +1 and f styM proqenv and estab^sn stocks . 
2 y w/y w ; +/TM6,Ubx X 6 y w/Y; +/+ 

. - o r : ' "' ■ 

S y w/Y ; +/TM6,Ubx X 9 y w/y w; +/+ 

B . schemes used to jump the enha neerless Gal4 insert 
1 - Jumpg from th «> x-chromosome 

29 FM3/™7 f w; +/+ X v 55 y w/Y; A2-3.Sb/TO6,Ubx 

99 FM7,w/ P[Gal4,w*l ^X 66 FM7/Y ; A2-3,Sb/+ 

99 FM7,w/ P[Gal4,w*]; A2-3,Sb/+^X 66 y w/Y; +/+ 

9 FM7,w/ y w ; A2-3,Sb/+ X 6 y w/Y; +/+ 

Select [w*] and [B] progeny and establish stocks 



2- Jumps the A?-? -chromosome 

99 y w/y w X <?<? y w/Y ; P[Gal4,w*], A2-3 f Sb/+ 
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S lect [w 4 ] and t sk>+ ] progeny and establish stocks. 

C. Chromosomal segregation 

To analyze the segregation of the insertions two stocks are used: 
w;+/+; TM3 # Sb/CxD and w; CyO/Sco;+/+. 

Method 

To create a large number of strains that express 
Gal 4 in a cell- or tissue-specific manner enhancer 
detection vectors have been built that carry different 
versions of the Gal4 gene. Two genes, encoding either 
the full-length protein or a truncated protein, have 
been cloned into rosy (ry*) and white (w*) P-element 
vectors (modified versions of plArB and plwB; Wilson et 
al., Genes and Development 3:1301-1313 (1989)). Using 
ry* or w* as a screen, these vectors have been 
mobilized by introduction of the A2-3 gene (Robertson 
et al., Genetics 128:461-470 (1988)). To visualize the 
pxpressicn pattern of Gal4 , the ;.Gal4 . insertion lines^ 
are crossed to a strain that carries the lacZ gene ^ 
under the control of the Gal4 UAS (Fischer et al . 
Nature 332:853-865 1988). Embryos, larvae and adults 
derived from these crosses are screened for 0- 
galactosidase expression either by an enzyme assay, 
with X-gal as a substrate, or by staining with 
monoclonal antibodies against 0-galactosidase. 0- 
galactosidase encoded by the UAS- J a cZ construct is 
localized in the cytoplasm. 

Approximately 500 Gal4-insertion strains have been 
screened and many that can be used to activate genes in 
specific tissues have, been identified such as, for 
example, epidermal stripes, mesoderm, the central 
nervous system and the peripheral nervous system. Many 
of the lines express 0-galactosidase in the salivary 
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glands as well as in other tissues. It is possible 
that in constructing the enhancerless-Gal4 transposon 
a position-dependent salivary gland enhancer was 
fortuitously generated. 



VI. Sample Screen 



| / ef Strains 


No Stainino 


ftai fvary Cjland 


other Tissues 


% 










5.8 


| 45 




+ 




28.8 








+ 


51*9 










13.5 


1 156 








100.0 







To activate a gene (Gene X) in a specific pattern, 
a Gal4 insertion line is selected and crossed to a 
strain that carries Gene X cloned behind the GAL UAS. 

VII. jgrnnmiirv Of GAL4/I1AS SVStem without the 

pibozvme 

The Gal4/UAS system is a two-part .. system for 
controlling gene activatiion. The method is versatile, 
can be tissue-specific and does not appear to exhibit 
a basal level of expression except perhaps, as 
described herein, for a UAS-DTA construct. It can be 
used to ectopically express characterized geines, to 
express modified genes that would otherwise be lethal 
to the organism and to express genes from other species 
to study their effect on Drosophila development. Since 
the method makes it possible to produce dominant, gain- 
of-f unction mutations, epistasis tests and screens for 
enhancers or suppressors of visible or lethal 
phenotypes can be carried out. The Gal4 system also 
allows the expression of toxic products to study the 
consequences of cell- and tissue-specific ablation. 
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VIII . Use of Gal 4 -Expressing Dr osophila with the 

dta Ribozvme of the Invention 

Expression of the fusion ribozyme carrying the 
sequences encoding the DTA protein was placed under the 
control of a the GAL4 UAS (upstream activator sequence) 
in pUAST (Figure 10). As stated supra, using modified 
P-element enhancer-trap vectors described above, a 
large number of stable lines of Drosophila were 
constructed which each express the yeast 
transcriptional activator GAL4 in specific spatial and 
temporal patterns in the developing flies. Any gene 
under the control of the GAL4 upstream activator 
sequence (UAS) can be transformed and maintained 
singly, then induced in particular Drosophila tissues 
by genetic crossing to lines which express GAL4 (Figure 
7). However, it was not possible to take advantage of 
the Gal 4 system for expression of DTA per se without 
further modification, due to the difficulty in 
producing UAS -DTA transf ormants through leaky 
expression of the DTA. 

It was found that use of this two-element system * 
as a means of conditionally expressing DTA via a trans- 
splicing ribozyme (Figure 6) overcame these problems. 
In those cells expressing GAL4, the GAL4 protein 
provides the activity necessary for ribozyme 
transcription, and the GAL4 mRNA provides the target 
for trans-splicing necessary for DTA production. 

Drosophila embryos may be injected with ribozyme 
sequences placed under the control of a UAS promoter 
as described above, using techniques known in the art. 
Embryos injected with the Rz-DTA met construction will 
not survive, whereas normal transformed flies were 
obtained from embryos injected with both R2-DTA ile and 
Rz-DTA leu * This result suggested that the internal AUG 
codon was indeed acting as an initiation codon for th 
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translation of a toxic product after injection. The 
codon is adjacent to proposed NAD+ binding site in the 
DTA sequence, and to sequences conserved in the 
distantly related exotoxin A, another EF-2 specific 
ADP-ribosylase from Pseudomonas aeruginosa. 

Transgenic flies containing the Rz-DTA ile and Rz- 
DTA leu sequences under control of the GAL4 UAS were 
crossed to flies producing GAL4 in particular patterns 
of expression. For example, in one characterized line, 
line 1J3, the GAL4 gene was been inserted near the 
hairy gene, and mirrored its pattern of expression. 
The hairy gene product is produced in epidermal stripes 
in the even-numbered abdominal segments during 
embryogenesis. When a UAS-driven LacZ gene was 
introduced into 1J3 in which GAL4 is expressed in the 
same pattern as the hairy gene product, 0-galactosidase 
was found localized within the even-numbered stripes. 
When flies containing the Rz-DTA leu gene were crossed 
to this GAL4 -expressing line, normal progeny resulted. 
However, when flies containing Rz-DTA ile were crossed 
to the GAL4-expresGing' line, development of the progeny *«£ 
was arrested in embryogenesis. Darker colored bands 
were evident on the cuticles of the embryos, consistent 
with the death of underlying cells. When cuticle 
preparations were examined, the even-numbered denticle 
bands were disrupted or missing, particularly those of 
the 4th, 6th and 8th stripes (Figure 11). Other 
specific patterns of cell death were observed when the 
containing Rz-DTA ile flies are crossed to different 
GAL4 expressing genes. 
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Example 5 
Design of Pro-ribozvmes 

5 As a test for the design of pro-ribozymes , the 

CAT-LacZ trans-splicing ribozyme which described 
earlier was modified (Figure 2). Phylogenetic 
comparisons and mutational analysis (for review, see 
Cech, Ann Rev. Biochem. 59:543-568 (1990)) have 

10 indicated that a core region of the group I self- 

splicing introns is highly conserved and important for 
activity (Figure 8). For the construction of trans- 
splicing pro-ribozymes a helix immediately adjacent to 
this region, P8, was disrupted. In the first 

15 experiments, 13 or 18 nucleotides of new sequence were 

introduced into the 5' strand and loop of helix P8, to 
produce pro-ribozyme 1 and 2, respectively. The extra 
nucleotides were complementary to the 5' "anti-sense" 
portion of the ribozyme, while the flanking sequences 

20 were adjusted to conserve (1) the actual sequences at 

the base of P8, and (2) the extent of base-pairing ^ ^ 

possible within P8 (Figure 13). The extent of self- : y ' >; 

complementarity between the sequences inserted into 

helix P8 and the 5' "anti-sense" region of the pro- 

25 ribozyme is such that this new helix would be expected 

to form in nascent transcripts, in preference to helix 
P8. The formation of this alternative helix would also 
be expected to disrupt flanking secondary and perhaps 
tertiary interactions within the catalytic core of the 

30 ribozyme. Thus, mis-folding of the pro-ribozyme would 

render it catalytically inactive (Figure 14). However, 
base-pairing of the pro-ribozyme with the intended 
target RNA would displace the P8-"anti-sense" base- 
pairing, sequester the "anti-sense" sequences and allow 

35 re-formation of the P8 h lix and an active catalytic 
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domain. Displacement of the P8-"anti-sense" h lix 
results in a greater sum of base-pairs and allows 
proper folding of the catalytic domain, so should be 
energetically favored. 

CAT-LacZ pro-ribozymes 

Cloned sequences corresponding to the two CAT-LacZ 
pro-ribozymes were constructed using PCR-mutagenesis 
as discussed above, and RNAs were produced by in vitro 
transcription. The CAT-LacZ trans-splicing ribozyme 
was observed to undergo scission during transcription 
at the 3' splice junction, as a result of hydrolysis 
catalyzed by the intron sequences. Similar hydrolysis 
is seen in in vitro transcripts of the unmodified 
Tetrahymena thermophila intron. In contrast, 
transcripts of the different CAT-LacZ pro-ribozymes are 
more stable, with little cleavage evident under the 
same conditions (Figure 15) . This indicates that the 
pro-ribozymes are inactive, which would be expected if 
the catalytic sequences were mis-folded: Truncated;: 
forms of the pro-ribozymes were tested for specific 
endoribonuclease activity directed against the CAT RNA. 
CAT-LacZ pro-ribozyme RNAs were transcribed from 
templates truncated at the Seal site, to remove the 3' 
splice junction and LacZ sequences. Both ribozyme and 
pro-ribozyme RNAs are stable after removal of the 3' 
splice site. Incubation of the truncated pro-ribozymes 
with CAT RNA led to specific cleavage of the target RNA 
to give fragments of the expected sizes (Figure 16). 
Specific cleavage activity was seen at 37, 4 5 and 50 
degrees. 

Pro-ribozyme forms of the GAL4-DTA trans-splicing 
ribozyme were also constructed (Figure 17). Regions 
of 20 nucleotides (complementary to th "anti-sense" 
region) were inserted into the 5' strand and loop of 
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helix PB. The two pro-ribozymes differed in the extent 
f base-pairing possible in the modified helices P8, 
and GAL4-DTA pro-ribozyme 1 possessing both a longer 
stem and fever (3) accessible bases in the loop. The 
5 helix PB of GAL4-DTA pro-ribozyme 2 more closely 

resembles that of the CAT-LacZ pro-ribozyme 2, with a 
larger loop (14 bases) containing sequences 
complementary to the n anti-sense n region. Transcripts 
of the GAL4-DTA pro-ribozymes are more stable than 
10 those of the unmodified ribozyme. In particular, pro- 

ribozyme 2 is mainly intact after incubation in 
conditions that result in essentially complete self- 
cleavage of the ribozyme form (30' § 50°C, 10 mM MgCl 2 , 
2 mM GTP, see Figure 18) . 

15 

Having now fully described the invention, it will 
be understood by those with skill in the art that the 
scope may be performed within a wide and equivalent 
range of conditions, parameters and the like, without 
20 affecting the spirit or scope of the invention or any 

. v --embodiment thereof. •. ■ 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Haseloff , James 

Goodman, Howard M. 
Brand, Andrea 
Perrimon, Norbert 

(ii) TITLE OF INVENTION: Cell Ablation Using Trans -Splicing Rib02ymes 
(iii) NUMBER OF SEQUENCES: 56 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox 

(B) STREET: 1225 Connecticut Avenue, N.W., Suite 300 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT 

(B) FILING DATE: herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/642,330 

(B) FILING DATE: 17-JAN-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Goldstein, Jorge A 

(B) REGISTRATION NUMBER: 29,021 

(C) REFERENCE/DOCKET NUMBER: 0609.3496604 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202)633-7533 

(B) TELEFAX: (202)833-8716 



(2) INFORMATION FOR SEQ ID NO :1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEO ID NO:1: 

TGACGCAATT CAACCAAGCG CGGGTAAACG GCGGGAGTAA CTATGACTCT 

CTAAATAGCA ATATTTACCT TTGGAGGGAA AAGTTATCAG GCATGCACCT 

CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 

GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 

GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 

GGATGCAGTT CACAGACTAA ATGTCGGTCG GGGAAGATGT ATTCTTCTCA 

TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 

CAACACTGGA GCCGCTGGGA ACTAATTTGT ATGCGAAAGT ATATTGATTA 

GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 

CGCGCATGAA TGGATTA ' -'-r<r^-, 

(2) INFORMATION FOR SEQ ID N0:2: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 bases 
(8) TYPE: nucleic acid 

( C ) STRANDEDNESS : bot h 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGCCAAGCTT CTTTACGATG CCATTGGGAT ATATCAACCG TGGTATAAAC 
CCGTGGTTTT TAAAAGTTAT CAGGCATGCA CC 

(2) INFORMATION FOR SEQ IQ N0:3: 

(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 47 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GATTAGTTTT GGAGTACTCG TACGGATTCA CGGCCGTCGT TTTACAA 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GGCCGAATTC TTACAATTTC CATTCAGGCT GCGCAACTGT TGG 



*2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GGGAGACCGG AAGCTTCTTT ACGATGCCAT TGGGATATAT CAACGGTGGT 
ATAAAGCCGT GGTTTTTAAA AGTTATCAGG CATGCACCTG GTAGCTAGTC 
TTTAAACCAA TAGATTGCAT CGGTTTAAAA GGCAAGACCG TCAAATTGCG 
GGAAAGGGGT CAACAGCCGT TCAGTACCAA GTCTCAGGGG AAACTTTGAG 
ATGGCCTTGC AAAGGGTATG GTAATAAGCT GACGGACATG GTCCTAACCA 
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CGCAGCCAAG TCCTAAGTCA ACAGATCTTC TGTTQATATG QATGCAGTTC 300 

ACAGACTAAA TGTCGGTCGG GGAAGATGTA TTCTTCTCAT AAGATATAGT 350 

CGGACCTCTC CTTAATGGGA GCTAGCGGAT GAAGTGATGC AACACTGGAG 400 

CCGCTGGGAA CTAATTTGTA TGCGAAAGTA TATTGATTAG TTTTGGAGTA 450 

CTCGTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC 500 

CCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 550 

CTGGCGTAAT AGOGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC 600 

GCAGCCTGAA TGGAAATTGT AAG 623 



(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



i'Al; SEQUENCE DESCRIPTION: SEO ID N0:6: 

GTCGACCTTT TTAAGTCGGC AAATATCGCA TGTTTGTTCG ATAGACATCG AGTGGCTTCA 60 

AAAGTTATCA GGCATGCACC TGGTAGCTAG TCTTTAAACC AATAGATTGC ATCGGTTTAA 120 

AAGGCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 180 

GGAAACTTTG AGATGGCCTT GCAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAAGATG TATTCTTCTC ATAAGATATA GTCGGACCTC TCCTTAATGG 360 

GAGCTAGCGG ATGAAGTGAT GCAACACTGG AGCCGCTGGG AACTAATTTG TATGCGAAAG 420 

TATATTGATT AGTTTTGGAG TACTCGTCTC GATGATGTTG TTGATTCTTC TAAATCTTTT 480 

GTGATTGAAA ACTTTTCTTC GTACCACGGG AGTAAACCTG GTTATGTAGA TTCCATTCAA 540 
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AAAGGTATAC AAAABCCAAA ATCTQGTACA CAAGQAAATT ATQAC6AT6A TTGQAAA66G 
TTTTATA6TA CCGACAATAA ATACGACGCT QCG6GATACT CT6TA6ATAA TGAAAACCCG 
CTCTCTQ6AA AAQCTGGAG6 CQTGGTCAAA GTGAC6TATC CAQQACTGAC 6AA66TTCTC 
6CACTAAAA6 TQ6ATAAT6C CQAAACTATT AAGAAAQAQT TAG6TTTAAQ TCTCACTQAA 
CCGTTQATGQ AGCAAGTCGG AACQGAASAG TTTATCAAAA GGTTCGGTGA TGGTGCTTCG 
CGTGTAGTGC TCAGCCTTCC CTTCGCTGAG GGGAGTTCTA GCGTTGAATA TATTAATAAC 
TGGGAACAGG CGAAAGCGTT AAGCGTAGAA CTTGAGATTA ATTTTGAAAC CCQTGGAAAA 
CGTGGCCAAG ATGCGATGTA TGAGTATATG GCTCAAGCCT GTGCAGGAAA TCGTGTCAGG 
CGATCTTTGT GACTCGAG 



600 
660 
720 
760 
840 
900 
960 
1020 
1036 



(2) INFORMATION FOR 8EQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTTTAGTTGT TCACCTGAGT CGTGTGTTTT GTATTTTGCG TCTTAGTGTG 
CCTATGGACA AATCTGGATC TCCCAAT GCT AGTAGAACCT CCCGGCGTCG 
TCGCCCGCGT AGAGGTTCTC GGTCCGCTTC TGGT 



50 
100 
134 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
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GTTTAQTTQT TCACCTGAGT CQTGTTTTCT TTGTTTTGCQ TCTCAGTGTG 50 
CCTATGGACA AATCTGGATC TCCCAATGCT AGTAGAACCT CCCGGCGTCG 100 
TCGCCCGCGT AGAGGTTCTC GGTCCGCTTC TGGT 134 



(2) INFORMATION FOR 8EQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(11) MOLECULAR TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

GTTATTGTCT ACTGACTATA TAGAGAGTGT TTGTGCTGTG TTTTCTCTTT 50 

TGTGTCGTAG AATTGAGTCG AGTCATGGAC AAATCTGAAT CAACCAGTGC 100 

TGGTCGTAAC CGTCGACGTC GTCCGCGTCG TGGTTCCCGC TCCGCCCCCT 150 

CC 152 

(2) INFORMATION FOR SEQ ID NO; 10: - : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

GTTATTGTCT ACTGACTATA TAGAGAGTGT GTGTGCTGTG TTTTCTCTTT 50 

TGTGTCGTAG AATTGAGTCG AGTCATGGAT AAATCTGAAT CAACCAGTGC 100 

TGGTCGTAAC CGTCGACGTC GTCCGCGTCG TGGTTCCCGC TCCGCCTCCT 150 

CC 152 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
AGAGAGTGTG TGTGCTGT6T TTTCTCTTTT GTGTCGTAGA ATTGAGTCGA 
GTCATGGACA AATCTGAATC AACCAGTGCT GGTCGTAACC GTCGACGTCG 
TCCGCGTCGT GGTTCCCGCT CCGCCCCCTC C 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTTATTGTCT ACTGATTGTA TAAAGAGTGT GTGTGTGCTG TGTTTTCTCT 
TTTACGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGTCGCA ACCGTCGACG TCGTCCGCGT CGTGGTTCCC GCTCCGCCCC 
CTCC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) IIOLECULAR TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 



GTTATTGTCT ACTQACTATA TAGAGAGTGT QTQTGTQCTQ TGTTTTCTCT 



50 



TTTGTGTCGT AGAATTGAGT CGAGTGATGG ACAAATCTGA ATCAACCAGT 



100 



GCTGGTCGTA ACCGTCGACG TCGTTTGCGT CGTGGTTCCC GCTCCGCCTC 



150 



CTCC 



154 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAGTGTGTAT GTGCTGTGTT TTCTCTTTTG TGTCGTAGAA TTGAGTCGAG 50 
TCATGGAGAA ATCTGAATCA ACCAGTGCTG GTCGTAACC6 TCGACGTCGT ICO^^, 
CCGCGTCGTG GTTCCCGCTC CGCCCCCTCC 130 



(2) INFORMATION FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 152 bases 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

GTTATTGTCT ACTGACTATA TAGAGAGTGT GTGTGCTGTG TTTTCTCTTT 50 



TGTGTCGTAG AATTGAGTCG AGTCATG6AC AAATCTGAAT CAACCAGTGC 



100 
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78 



TGGTCGTAAC CATCGACGTC GTCCQCQTCG TGGTTCCCGC TCCGCCCCCT 150 

(2} INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 bases 

(6) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
6GA0GGGGCG GAGCGGGAAC CACGACGCGG ACGACGTCGA CGGTTACGAC 
CAGCCCTGGT AGATTCAGAT TTGTCCAT 

(2) INFORMATION FOR SEQ ID N0:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both ^ : 
" (D) TOPOLOGY: linear > 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 

TTTGCGTCTT AGTGTGCCTA TGGACAAATC TGGATCTCCC AATGCTAGT 49 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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TTTGCGTCTC AGTGTGCCTA TGQACAAATC TGGATCTCCC AATGCTA6T 



(2) INFORMATION FOR SEO ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

( ii ) MOLECULAR TYPE : DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ATAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID N0:21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
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TTTGTGTCGT AQAATTGAGT CQAGTCAT6G ACAAATCT6A ATCAACCAGT 
6CTQQT 

(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: ONA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TTTACGTCGT AQAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
SCTGGT 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 

TTTGTGTCGT AQAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 
56 
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(li) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 
TTTGTGTCGT AGAATTQAQT CQAQTCATQQ ACAAATCTGA ATCAACCAGT 50 
GCTGGT 56 

(2) INFORMATION FOR SEQ ZD NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 50 
GCTGGT 56 



(2) INFOHMATIC* FOR SEQ ID N0:2§: ..■> . : • . - . ; . ; W 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGACAAATC TGAATCAACC 50 
AGTGCTGCA 59 



(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 



WO 92/13089 



- 68 



PCT/US92/00277 



(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:27: 
GCACTGGTTG ATTCAGATTT GTCCATGACT CGACTCAATT CTACGACACA 



50 
51 



(2) INFORMATION FOR SEQ ID N0:28: 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



59 bases 
Nucleic Acid 
Single 
Linear 

Nucleic Acid 



(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 
AATTTTGTGT CGTAGAATTG AGTGSAGTCA TGQACAAATC T6AATCAACC 
AGTGCT6CA 



59 



(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 



AGCATTGGTA TCATCAG6TT TGT 



23 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTTGATGATG TTGTTGATTC T 21 



(2) INFORMATION FOR SEQ ID NO: 31: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 



MET ASP LYS PHE ASF ASP VAL VAL ASP SER 
5 10 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 



ATGGACAAAT TTGATGAT6T TGTTGATTCT 



30 
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(2) INFORMATION FOR SEQ ID N0:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

AATTTTGTGT CGTAGAATTG AGTC6AGTCA T6GACAAATC TGAATCAACC 

AGTGCTGCA 



50 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 
AGCCATCCTT GGTTCAG 17 



(2) INFORMATION FOR SEQ ID N0:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 



GTAAGGGTGG ATGTT 



15 
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(2) INFORMATION FOR SEQ ID N0:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 



MET ASP LYS SER GLU LEU ARG VAL ASP VAL 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 37: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:37: 
ATGGACAAAT CTGAATTAAG GGTG6ATGTT 30 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 
TCTCGATGAT 6TTGTTGATT CTTCTAAATC TTTTGTGATG GAAAACTTTT 50 



CTTCGTACCA CGGGACTAAA 



70 
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(2) INFORMATION FOR SEQ ID N0:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: Peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 

MET GLU ASN PHE SER SER TYR HIS GLY THR LYS 
1 5 10 



(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGATT GAAAACTTTT 

CTTCGTACCA CGGGACTAAA 

(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGTTG GAAAACTTTT 



50 
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CTTCGTACCA CGGGACTAAA 70 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(li) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

ATGAAGCTTC TCGATGATGT TGTTGATTCT TCTAAATCTT TTGTGATGGA 50 
AAACTTTTCT TCGTACCACG GGACTAAA 78 



(2) INFORMATION FOR SEQ ID NO: 43: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 



MET LYS LEU LEU ASP ASP VAL VAL ASP SER SER LYS SER PHE VAL 

15 10 15 

MET GLU ASN PHE SER SER TYR HIS GLY THR LYS 

20 25 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

( 

ATGGAGAAAA AAATGACTGG ATATACCACC GTTQATATAT C 41 

(2) INFORMATION FOR SEQ ID N0:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 

MET GLU LYS LYS ILE THR ASP SER LEU ALA VAL VAL LEU GLN ARG 
15 10 15 

ARG ASP 
17 

(2) INFORMATION FOR SEQ ID N0:46: 

(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 
ATGGAGAAAA AAATTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGTGA 



C 

(2) INFORMATION FOR SEQ ID N0:47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
ATGAAGCTAC TGTCTTCTAT CQAACAAGCA TGCQATATTT 40 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

MET LYS LEU LEU ASP ASP VAL VAL ASP SER SER LYS SER PHE VAL 
15 10 15 

MET GLU ASN PHE SER 
20 



(2) INFORMATION FOR SEQ ID NO: 49: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
ATGAAGCTTC TCGATGATGT TGTTGATTCT TCTAAATCTT TTGTGATGGA 50 
AAACTTTTCT 60 



(2) INFORMATION FOR SEQ ID N0:50: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 72 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: lin ar 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 
AUGGABAAAA AAAUCACUGG AUAUACCACC 6UUGAUAUAU CCCAAUG6CA UCGUAAAGAA 

CAUUUUGAGG CA 

(2) INFORMATION FOR SEQ ID N0:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 
AAGCUUCUUU ACGAUGCCAU 
AGUUAUCAGG CAUGCACCUG 
GGCAAGACCG UCAAAUUGCG 
AAACUUUGAG AUGGCCUUGC 
CGCAGCCAAG UCCUAAGUCA 
UCGGUCGGGG AAGAUGUAUU 
UAGCGGAUGA AGUGAUGCAA 
UUGAUUAGUU UUGGAGUACU 
(2) INFORMATION FOR SEQ ID N0:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 



UGGGAUAUAU CAACGGUGGU AUAAAGCC6U GSUUUUUAAA 60 

GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 

ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 

CUUCUCAUAA CAUAUAGUCG GACCUCUCCU UAAUGGGAGC 360 

CACUGGAGCC GCUGGGAACU AAUUUGUAUG CGAAAGUAUA 420 

CGUACGGAUU CACUGGCCGU CCUGUUACAA CGUCGUGAC 479 
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(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xl) SEQUENCE DESCRIPTION: 8EQ ID NO: 52: 

AAGCUUCUUU ACGAUQCCAU UGGQAUAUAU CAACGGUGGU AUAAA0CC6U 6GUUUUUAAA 60 

AGUUAUCAGG CAU6CACCUQ 6UA6CUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 

UCGGUCGGGG AAGAUGUAUU CUUCUCAUAA CAUAUAGUCG GACCUCUCCU UAAUGGGAGC 360 

UAGCGGAUGA AGUGAUGCAA CACUGGAGCC GCUGGGAACU AAUUUGUAUG CGAAAGUAUA 420 

UUGAUUAGUU UUGGAGUACU CGUACGGAUU CACUGGCCGU CCUGUUACAA CGUCGUGAC 479 
(2) INFORMATION FOR SEO ID N0:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 480 bas© pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:53: 

AAGCUUCUUU ACGAUQCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 60 

AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 
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UCGQUCGGGA CCQUUGAUAU AUGGUUCAUA ACAUAUAGUC QGACCUCUCC UUAAUGGGAG 360 
CUAGCGGAUG AAGUGAUGCA ACACUGGAQC CGCUQGGAAC UAAUUUGUAU GCGAAAGUAU 420 
AUUGAUUAGU UUUGGAQUAC UCGUACGQAU UCACUGGCCG UCCUGUUACA ACGUCGUGAC 480 

{2} INFORMATION FOR SEQ ID N0:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 60 

AGUUAUCAGG CAUGCACCUG GUA6CUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAACUUUGAG AUGGCCUUGC AAAGG3UAUG GUAAUAAGUU GACGGAC.AUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 

UCGGUCGGGA CCGUUGAUAU AUCCCAAACG GUUCAUAACA UAUAGUCGGA CCUCUCCUUA 360 

AUGGGAGCUA GCGGAUGAAG UGAUGCAACA CUGGAGCCGC UGGGAACUAA UUUGUAUGCG 420 

AAAGUAUAUU GAUUAGUUUU GGAGUACUCG UACGGAUUCA CUGGCCGUCC UGUUACAACG 480 
UCGUGAC 

(2) INFORMATION FOR SEQ ID N0:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GTCGACCTTT TTAA6TCQQC AAATATCQCA TQTTT6TTC6 ATAGACATCG AGTGQCTTCA 60 

AAAGTTATCA GGCATGCACC TGQTAGCTAG TCTTTAAACC AATAQATTQC ATCQGTTTAA 120 

AAGGCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 180 

GGAAACTTTG AGATGGCCTT GCAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAACAAC ATGCGATATT GTTCTCATAA GATATAGTCG GACCTCTCCT 360 

TAATGGGAGC TAGCGGATGA AGTGATGCAA CACTGGAGCC GCTGGGAACT AATTTGTATG 420 

CGAAAGTATA TTGATTAGTT TTGGAGTACT CGTCTCGATG ATGTTGTTGA TTCTTCTAAA 480 

TCTTTTGTGA TTGAAAACTT TTCTTCGTAC CACGGGACTA AACCTGGTTA TGTAGATTCC 540 

ATTCAAAAAG GTATACAAAA GCCAAAATCT GGTACACAAG GAAATTATGA CGATGATTGG 600 

AAAGGGTTTT ATAGTACCGA CAATAAATAC GACGCTGCGG GATACTCTGT AGATAATGAA 660 

AACCCGCTCT CTGGAAAAGC TGGAGGCGTG GTCAAAGTGA CGTATCCAGG ACTGACGAAG 720 

GTTCTCGCAC TAAAAGTGGA TAATGCCGAA ACTATTAAGA AAGAGTTAG6 TTTAAGTCTC 780 

ACTGAACCGT TGATGGAGCA AGTCGGAACG GAAGAGTTTA TCAAAAGGTT CGGTGATGGT B40 ^ 

GCTTCGCGTG TAGT6CTCAG CCTTCCCTTC GCTGAGGGGA GTTCTAGCGT TGAATATATT 900 

AATAACT6GG AACAGGCGAA AGCGTTAAGC GTAGAACTTG AGATTAATTT TGAAACCCGT 960 

GGAAAACGTG GCCAAGATGC GATGTATGAG TATATGGCTC AAGCCTGTGC AGGAAATCGT 1020 

GTCAGGCGAT CTTTGTGACT CGAG 1044 
(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56: 
GTCQACCTTT TTAAGTCQGC AAATATCQCA TQTTT6TTCG ATAGACATC6 AGTGGCTTCA 
AAAGTTATCA G6CAT6CACC T6STAGCTAG TCTTTAAACC AATAQATTQC ATCGGTTTAA 
AAGGCAAGAC CGTCAAATTG CQQQAAAGGG QTCAACAGCC QTTCA6TACC AAGTCTCAGG 
6QAAACTTTG AGATGQCCTT QCAAAGGGTA TQQTAATAAG CTQACQQACA TQGTCCTAAC 
CACGCAGCCA AQTCCTAAQT CAACAQATCT TCTGTTQATA TGQATGCAQT TCACAGACTA 
AATQTCQGTC 6G6CAAACAT 6CQATATTTQ CCGTTTGTCA TAAQATATAG TCGGACCTCT 
CCTTAATGGQ AGCTAGCGQA T6AAQTQATQ CAACACT6GA GCCGCTGGGA ACTAATTTGT 
ATGCGAAAGT ATATTGATTA GTTTTGGAGT ACTCGTCTCG ATGATGTTGT TGATTCTTCT 
AAATCTTTTG T6ATTGAAAA CTTTTCTTCG TACCACGGGA CTAAACCTGG TTATGTAGAT 
TCCATTCAAA AAGGTATACA AAAGCCAAAA TCTGGTACAC AAGGAAATTA TGACGATGAT 
TGGAAAGGGT TTTATAGTAC CGACAATAAA TACGACGCTG CGGGATACTC TGTAGATAAT 
GAAAACCCGC TCTCTGGAAA AGCTGGAGGC GTGGTCAAAG TGACGTATCC AGGACT6ACG 
AAGGTTCTCG CACTAAAAGT GGATAATGCC GAAACTATTA AGAAAGAGTT AGGTTTAAGT 
CTCACTGMC C3TTGATGGA GCAAGTCGGA ACGGAAGAGT TTATCAAAAG GTTC63TGAT 
GGTGCTTC6C GTGTAGTGCT CAGCCTTCCC TTCGCTGAGG GGAGTTCTAG CGTTGAATAT 
ATTAATAACT GGGAACAGGC GAAAGCGTTA AGCGTAGAAC TTGAGATTAA TTTTGAAACC 
CGTGGAAAAC GTGGCCAAGA TGCGATGTAT GAGTATATGG CTCAAGCCTG TGCAGGAAAT 
CGTGTCAGGC GATCTTTGTG ACTCGAG 
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What is claimed is: 

1. A polynucl otide molecule, said mol cule 
encoding a trans-splicing ribozyme, the sequence of 
said ribozyme being a fusion RNA, the sequence of such 
fusion RNA comprising: 

(1) a first RNA sequence, said first RNA 
sequence being sufficient for targeting said 
ribozyme to hybridize to a target RNA, and 
(2) a second RNA sequence, said second RNA 
sequence being capable of being transferred 
colinearly into a target RNA as a result of 
the trans-splicing activity of said 
ribozyme; 

wherein expression of said polynucleotide molecule is 
operably linked to expression of a transcriptional 
activator protein, and wherein said first RNA sequence 
is a sequence that hybridizes to an RNA that encodes 
said transcriptional activator protein. 

2. The polynucleotide molecule of claim 1, 
wherein, said transcriptional activator is GAL4 . . 

3. The polynucleotide molecule of claim 1, 
wherein said second RNA sequence comprises a sequence 
that encodes a peptide toxic to the host cell. 

4. The polynucleotide molecule of claim 3, 
wherein said peptide is the DTA peptide. 

5. The polynucleotide molecule of claim 4, 
wherein said DTA peptide is a mutant peptide sequence. 

6. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence comprises amino 
acids encoded by SEQ ID. No. 40. 
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7. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence compris s amino 
acids encoded by SEQ ID. No. 41. 

8. The polynucleotide molecule of claim 1, 
wherein said first RNA sequence is a sequence that 
hybridizes to GAL4 RNA and wherein said second RNA 
sequence is a seguence that encodes the DTA peptide. 

9. The polynucleotide molecule of any one of 
claims 1-8, wherein said molecule is RNA. 

10. The polynucleotide molecule of any one of 
claims 1-8, wherein said molecule is DNA. 

11. ' A polynucleotide molecule comprising a 
ribozyme expression cassette, said cassette being 
capable of being stably inserted into the genome of a 
host, and said cassette comprising the sequence of a 
promoter capable of functioning in such host, operably- 
linked to the coding sequence of the polynucleotide ox 
any one of claims 1-9. 

12. A host cell comprising the polynucleotide 
molecule of claim 11. 

13. The host cell of claim 12, wherein said host 
cell is a viral cell. 

14. The host cell of claim 12, wherein said host 
cell is a prokaryotic cell. 

15. The host cell of claim 12, wherein said host 
cell is a eukaryotic cell. 
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16. The host cell f claim 15, wherein said 
eukaryotic cell is a plant cell. 

17. The host cell of claim 15 f wherein said 
eukaryotic cell is an animal cell. 

18. The host cell of claim 17, wherein said 
animal is Drosophila. 

19. The host cell of claim 17, wherein said 
animal is a mammal. 

20. The host cell of claim 19, wherein said 
animal is human. 

21. A method for in vitro trans-splicing, such 
method comprising the steps of: 

(1) providing the polynucleotide molecule of 
claim 9 in a trans-splicing reaction mixture, said 
polynucleotide comprising a sequence capable of 
hybridizing with a second polynucleotide; 

(2) providing said second polynucleotide to such 
reaction mixture; and 

(4) catalyzing the trans-splicing of said second 
polynucleotide under said conditions. 

22. A method for in vivo trans-splicing, said 
method comprising the steps of: 

(1) providing a polynucleotide of claim 9 to a 
host cell; 

(2) expressing said ribozyme encoded by said 
molecule in said host cell; 

(3) expressing a substrate of said ribozyme in 
said host cell; and 

(4) catalyzing the trans-splicing of said ribozyme 
with such substrate in said host cell. 
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23. A method for inactivating the activity of a 
target RNA, said method comprising: 

(1) providing the polynucleotide of claim 9 to a 
trans-splicing reaction mixture, said ribozyme 
possessing catalytic activity against a target RNA, 
said catalytic activity resulting in the inactivation 
of the functioning of said target RNA; 

(2) providing said target RNA to said mixture; and 

(3) providing conditions that allow said 
polynucleotide to express said catalytic activity. 

24. A method for providing a desired genetic 
sequence to a host cell in vivo, said method 
comprising: 

(1) providing the polynucleotide of claim 9 to 
said host cell, said polynucleotide possessing 
catalytic activity against a target RNA in said host 
cell, said ribozyme being capable of trans-splicing 
said desired genetic sequence; 

(2) providing said target RNA in said host cell; 

and . ... 

(3) providing conditions that allow said ribozyme 
to trans-splice said desired genetic sequence into the 
sequence of said target RNA. 

25. A method for cell ablation in multicellular 
plants and animals, said method comprising providing 
the polynucleotide of claim 9 to a fertilized embryonic 
host cell, said ribozyme encoding the sequence of a 
peptide toxic to said host cell and said ribozyme being 
capable of trans-splicing said sequence into a target 
in said host cell. 

26. A method for engineering male or female 
sterility in a plant, said method comprising providing 
the polynucleotid of claim 9 to a germ cell of said 
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sp cies, said ribozyme being targeted to an RNA that, 
wh n expressed as a protein, is necessary for the 
fertility of said plant and results in ablation of the 
cell that expresses said protein. 

5 

27. A method for immunizing plants against a 
plant pathogen, said method comprising transforming 
plant cells with the polynucleotide of claim 9, wherein 
said polynucleotide encodes a trans-splicing sequence 
10 capable of providing immunity against said pathogen to 

said plant, and wherein infection of a cell from said 
plant with said pathogen results in the ablation of 
said cell. 

15 28. The polynucleotide molecule of any one of 

claims 1-7 wherein said ribozyme is a pro-ribozyme. 

29. The polynucleotide molecule of any one of 
claims 8 wherein said ribozyme is a pro-ribozyme. 

20 

20, The polynucleotide molecule of any one of t >>?#ft| 
claims 9 wherein said ribozyme is a pro-ribozyme. 



31. The polynucleotide molecule of any one of 
25 claims 10 wherein said ribozyme is a pro-ribozyme. 

32. The polynucleotide molecule of claim 11, 
wherein said ribozyme is a pro-ribozyme. 

30 33. The host cell claim 12, wherein said ribozyme 

is a pro-ribozyme. 

34. The method of claim 21, wherein said ribozyme 
is a pro-ribozyme. 



35 
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35. Th method of claim 22, wh rein said ribozyme 
is a pro-rribozyme. 

36. The method of claim 23, wherein said ribozyme 
is a pro-ribozyme. 

37. The method of claim 24, wherein said ribozyme 
is a pro-ribozyme. 

38. The method of claim 25, wherein said ribozyme 
is a pro-ribozyme. 

39. The method of claim 26, wherein said ribozyme 
is a pro-ribozyme. 

40. • The method of claim 27, wherein said ribozyme 
is a prb-ribozyme. 
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\ Self-splicirig of Group I introns. 

1. Guanosine-mediated cleavage of 5* splice site. 

6© 




2. Cleavage of 3' splice site. 




3" exon 



3. Ligation of exon segments. 



5* exon 



3* exon 




4. Circularization of intron. 
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HI tn*tt«<r : Jwiiiy. Mniiy «,•"»« *«07iH »! 

"T-lC «CAT-1»CZ tJ-Rz -> U*t 



•MA 



*«p»«nce : casjL.p. \ GCGAGACCCGAA 



» ?0 



I 20 



30 



1 C GSAGA C CC C AAGCTTCTTT X0GA7CCCAT 
W CU lUli AAA AG^XAJCfcCC CAJCCACCTC 
121 CCCTTOAAA CGCAAGACCC TCAAATTG06 
Itt CIC1CA06CS AA^CTTTCAG MCCCCTTOC 

dii oocxMoa cc c acocaa c xccxaactca 

301 ACACACTAAA ttftm il U . GGMGATGZA 
3d CT7AA3C9GA OCZA0CGG&X GAACTGATOC 

421 yc oaAAC ? sneuexM zmeoxex* 

411 AACC TCCMA CJflWAAAAC CCTCOCCTXA 

$U cmcocaw cxpaeexm acocaasaoc 

<01 GCACOCIGAA XCCAAAJTCT AW 

I io !l 30 I 30 



GCAAATTCTAAC lint*l 



I 40 
TGGGATAJAT 
CXACCTACT£ 



1 50 
CAACCCTCCT 
TTTAAACCAA 
CAACAOCGCT 
CTAATA AOCT 
T0T76A1ATC 
WC H CT CJ tt AACAXAIACT 
MOfcCSGCAC OOGCIQGGMfc 
CTOCTACOOA 
CpftAC TTAA 
COpOCAOCCX 




40 



maxima 
tcooccinc 

I 50 



I 60 
ATAAACOOCT CO 
XACAXTCCA7 120 
TCAC1A0CAA ISO 
CACGGtCAJC 340 
CASGGAGI3C 300 
CCCACCTC2C 340 
CXAAXTTOXA 420 
440 
540 

CAACACTTOC 400 
C23 

I 40 
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CncumbeV kofc wns ^NA 4 trans-splicing rib zymes 



I 

Virus RNA target' sequences. 

CMV is lates. |j , 7 

PPPC CGPCy'. .WAGOGOGCCU 
UUW»CbU COpAC..» . . , .UGUGCCU 
UUUUUUXXUAGAXtjpGAGOCGAGUC 

y^Cfe00SPa gMPDSl>gg06a>6DC 
UUUUA>UU$AGAAUXK»GUCGAGUC 

n, "" u< ""■ uftGAAuuGAGUCGAGoc 



I 
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00 



AOS GAC 
ios GAC 

ins gac 

AEG GAD 
J&ZS GAC 
ACS GAC 
ZilS GAC 
2iffi GAC 
Ate GAC 



1 
I 

AAA a. CO 
AAA U.CU 
AAA U«CU 
AAA U.CU 

aaa u.cu 

AAA U.CU 
AAA U.CU 

aaau.cu 
AAA u.cu 



2 
I 

GGA U.CU CCC AAD GCD AGU* 
GGA CCD CCC AAD GCU AGU - 
GAA U.CA ACC ACT GCU GGU - 
GAA U.CA ACC AGU GCU GGU- 
GAA U.CA ACC AGU GCU GGU* 
GAA U.CA ACC AGU GCU GGU — 

GAA U.CA ACC AOU GCU GGU ->Z 

^ OCU GGU 



StQ lb No 



-'7 
if 



C*A U.CA ACC AGU GCU GGU 



Oligonoded'tid| target sequences. 



2C 



ZCOR 



1 

• I 

• i 

»mv rna V - 

41 • ," ! "; »n mmm <« bp) 

• -UGUUOG GACUACUAUGGOTACGA ... S» 

' *l 1 1 1 1 1 

RIBOZXME CORE G.UOGADGAtJGOUGOOGAUUCU 3 « 



diphtheria toxin ;A-chain trans-splicing ribozymes. 



x 
1 



il 



3o 



met asp lys phe asp asp val val asp ser 
AUG GAC AAA DUO GAB GAU COT GOT GAO UCU 



3/ 
32. 



» . # — 



Mill* HI III (46 bp) 

"- GACUUG GOOCCOAOCGA ... 5» 

1 •1111*111 
RIBOZYKE CORE G.UAAGGGUC-GAOGOT .. 3' 



1>3 
36" 



net asp lys ser ol« l« u axg val asp val 

AUG GAC AAA UCU GAA UUA AGG GUG GAU GUU ... 3< 



3>C 

37 
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GaW gene 



P-element mediated insertion 




Cellular; enhancer 



tissue-specific expression of GaI4 protein 
i dimer 



0 



transcriptional activation of gene X 



opsteam activator sequences (UAS) 



P-element mediated "enhancer-trapping" for expression of 
Gal4 protein. f 
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"Pro-ribozyme" design. 

(1). CAT-LbcZ a-peptide trans-splicing ribozyme 

E • 

» * #» 

; :\ •/ 0 ACTIVE 

• . •/ u £W No.sQ 

* \ \ mix vs 
• 

~ ACOCC.O ACCCXOOCACOCKXCOCCOTOOACAACCOCCOCACOCC ... 

3* 



(2). CAT-LacZ a-peptlde trans-splicing "pro-rlboryme" #2 

* % iitt dm tiinn iii 



„ con INACTIVE 

Dlaropud VUX M • * r , _ 

.... • - * ISft? /O Ate. s~v7 
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* ^ ACOCC.O ACS OA0 OCA COS CCC CDC COO OOA CAA CCD CCD CAC OCC ... 
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Modification o* helix P8 for pro-ribozyme construction. 




Wild-type c 

GGGAAGAG 
I * I I I * I U 
CUCUUCU A 
U 

QAL4-DTA pro-ribozyme 1 

[S*9 a »»©. C 6 20 nfc 

GGGAACAA A U 
1*111111 II C 
CUCUUGUU U A 

A G 



CAT-LacZ pro-ribozyme 1 

[MS i>. Me. S37 U G 13 nt 

GGACCGU A 
1*111*1 
CUUGGUA U 
U A 



CAT-UcZ pro-ribozyme 2 * D A ._ ■ „ 

L J G G A C C G U U 

1*111111 A 
CUUGGCAA 

A U 
C C C 



GAL4-DTA pro-ribozyme 2 u G c G 
J£«q » Mb. 5i>j A A 20 nt ^ 

GGCAAAC U 

1*11111 

C U G U U U G A 
C U 
C G U U 



N = ccnp lenient a ry to ribozyme "ar.ti-sense" ^ 
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